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ON THE UNBIASED CHARACTER OF LIKELIHOOD-RATIO TESTS 
FOR INDEPENDENCE IN NORMAL SYSTEMS 


By’ Joserpy F. Daty 


1. Introduction. In the statistical interpretation of experimental data, the 
basic assumption is, of course, that we are dealing with a sample from a statistical 
population, the elements of which are characterized by the values of a number of 
random variables z’, --- , 2°. But in many cases we are in a position to assume 
even more, namely, that the population has an elementary probability law 
f(z’, ---, 2"; 01, +--+, 6s), where the functional form of f(z, 0) is definitely 
specified, although the parameters 6, , --- , 0, are to be left free for the moment 
to have values corresponding to any point of a set 2 in an h-dimensional space. 

Under this assumption, the problem of obtaining from the data further infor- 
mation about the hypothetical distribution law f(z, 6) is considerably simplified. 
For it is then equivalent to that of deciding whether or not the data support the 
hypothesis that the population values of the 6’s correspond to a point in a certain 
subset w of 2. For example, we may have reason to believe that the population 
K has a distribution law of the form 


4 —} y A4j(z'—2a*) (zi—a?) 
a a A;; é 
f(a, 2°03, a, An, Au, An) = Aa!’ ” 


Here the set 2 is composed of all parameter points (a’, --- , Age) for which the 
matrix || A;; || (z,7 = 1, 2) is positive definite and for which — © <a’ < o. 
We may wish to decide, on the basis of N independent observations (2) , 2%) 
drawn from K, whether Ai: has the value zero for the population in question, 
without concerning ourselves at all about the values of the remaining param- 
eters; in other words, we may wish to test the hypothesis H that the parameter 
point corresponding to K lies in that subset of 2 for which Aiz2 = 0. One way to 
test this hypothesis is to select some (measurable) function g(x) whose value can 
be determined from the data, say 


> (et — 2) (22, — #) 


OF ee l[za—=] 


Now g(2) is itself a random variable, so that it has a distribution law of its own 

when its constituent x’s are drawn from any particular population K. Suppose 

then we choose a set of values of g(x), say S, such that the probability is only .05 

that g(x) will lie in the set S when the z’s are drawn independently from a 

population K for which the above hypothesis H is true. Ordinarily we would 
1 
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take S to be of the form | g(x) | > go , and the test would then reject H at the .05 
probability level if the computed value of g(x) came out too large. But for all 
that has been said so far, we are perfectly free to choose a different critical 
region S, and even a different function g(x). The essential elements of this 
type of test are then a critical region S, a function of the data g, and a probability 
level ¢, such that the probability is « = .05, say, that g C S when A is true; in 
employing the test we reject H at the given probability level whenever the 
sample value of g falls in the critical region. 

By the very nature of the problem, any inferences we make from a sample are 
subject to possible error. Ih the kind of test under consideration, the only error 
we can commit, strictly speaking, is that of rejecting H when it is true (an error of 
Type I in the terminology of Neyman and Pearson [9]). The risk of such an 
error is thus known in advance; for if we use the test consistently at, say, the .05 
level, we know that the probability is .05 that we shall be led to reject a given 
hypothesis when it is true. On the other hand, it is quite conceivable that the 
test may be even less likely to reject H when it is false, or more precisely, when 
the true 6’s correspond to a point of 2 which is not inw. In this event the test is 
said to be biased. Let us make this term more definite by proposing the follow- 
ing definitions: 

DEFINITION I. A test ts said to be completely unbiased if it has the property 
that for any probability level « (0 < € < 1) the probability of rejecting H is greater 
when the @’s correspond to a point of 2 — w than when they correspond to a point of w. 

DEFINITION II. A test is said to be locally unbiased 2f the set Q contains a 
neighborhood U of w such that for any probability level « (0 < € < 1) the probability 
of rejecting H 1s greater when the parameter values correspond to a point of U — w 
than when they correspond to a point of w. 

It is the purpose of this paper to consider the question of bias in connection 
with the Neyman-Pearson method of likelihood ratios [8] as applied to the 
testing of what may well be called hypotheses of independence in multivariate 
normal populations. The likelihood ratio method is undoubtedly a very familiar 
one, since the vast majority of tests in present statistical practice are based on 
this method. But for the sake of completeness we shall outline it briefly. Let 
the distribution law of the population K be of the form Oe, +++, 8 th, +> 
where the 6’s may correspond to any point in a set Q, and let the hypothesis H 
to be tested be that the 6’s actually belong to the subset w of 2. Form the 
likelihood function 


N 
Py(zx; 6) = size, evga, ~++, On) 


i.e., the elementary probability law of a sample of N elements drawn inde- 
pendently from K. Denote by Py(zr) the maximum of Py for fixed x where the 
6’s are allowed to range over 2; and denote by Px (zx) the corresponding maximum 
value when the @’s are restricted tow. The test criterion is then 


_ F(z) 
P* (2). 
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Evidently \ depends only on the observable quantities z,, and has the range 
0 <A <1, with a definite probability law depending on that of the basic popula- 
tion K. In this method the critical region S is taken to be 0 < A < A. , where 
d. is so chosen that the probability P{A < \,} is « when the parameters of K 
correspond to a point inw. (It may be noted here that in all the cases with 
which we shall have to deal the probability that \ lies in S when H is true is ° 
independent of the particular values of the 6’s as long as they correspond to a 
point of w.) The reason for taking the critical region to be of the form0 <A < 
d,. and not, say, N<A< Mork <A<1 may become clearer when we examine 
the resulting tests for bias. 

The recent work of Neyman and Pearson [10] has led them to lay considerable 
stress on the importance of unbiased tests. And though their attention has been 
directed mainly to the broader outlines of the theory of testing hypotheses, 
they have stimulated other writers to study particular tests of great practical 
importance. P.C. Tang [11] has obtained the general sampling distribution of 
1 — d”” for what we shall call the regression problem with one dependent variate, 
and has given tables for P{A < A,}—essentially proving the unbiased character 
of the test—which should be extremely useful. His article also contains an 
excellent discussion of the manner in which this test is related to the well known 
tests of linear hypotheses [7] and to the ordinary analysis of variance. P. L. 
Hsu [6] has shown that this same distribution is fundamental in the study of 
Hotelling’s generalized 7’ test [5] (a special but important case of what we shall 
call the general regression problem), and has. proved that (locally) this test is 
not only unbiased but “‘most powerful” in a certain sense. On the other hand, 
it is not true that all likelihood ratio tests are unbiased [2]. Consequently, the 
knowledge that in a rather wide class of problems which arise in normal sampling 
theory the method of likelihood ratios furnishes tests which are either locally or 
completely unbiased would seem to be of some value, even when the exact 
sampling distribution of the criterion is too complicated to tabulate. 


2. The regression problem with one dependent variate. Suppose that y is 
known to be normally distributed about a linear function of the fixed variables 
x, ---, 2’, 80 that the family of populations under consideration is characterized 
by a distribution function of the form 


ake yD bz ; 
(2.1) fly|z, b, a”) a (290) *e 202 i=1 ) > 


where the set of admissible values of o” and the b’s is 
Q20<07 <w, -x7 <i < om. 


Let H be the hypothesis that the point (0°, b;, --- , by) lies in the subset of 2 
defined by 


@: bet = baie = coco b, = 0. 
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The likelihood ratio appropriate to testing the hypothesis H on the basis of N 
(N > r) independent observations drawn from such a population is then 


7 & (- Eh) " 
X (ve — 2 bts!) 


a=1 


, : : 1 
with the understanding that the values of the fixed variables xz, , ---, 2% asso- 
ciated with the a-th observation have been so chosen that the matrix || a’’ || = 


=. LX is positive definite. (The expression in the numerator is the mini- 


|| asl 
N 

mum of >» (v. . bix 5 for variations of the b’s over Q, while the denomina- 
a=1 


tor contains the mein minimum for variations of the b’s over w). 
In order to show that the test is unbiased, we shall make use of the exact 
sampling distribution of the quantity 


E=1—)", 


first published by P. C. Tang [11]. Writing || A,, || for the inverse of the 
matrix || a” || composed of the first ¢ rows and columns of || a’? || 


|| , let us put 
r q 
jan. = Gs a a! Ana) baby. 


20? k,i=qt1 gh=l 


Since the critical region 0 < \ < \, corresponds to the region 1 — Az’ = ¢. < 
& < 1, it can then be shown that the probability of rejecting H when the popula- 
tion parameters have specified values a’, b’, -- - , b’ is expressed by the series 


0 Gg’ 1 pers tt pint 
yao v! Je, BIZ(r — g) +», 3(N — 7)] 


ae r(u)T(v) ae . u—1 saat v—1 
B(u, v =e = fe (1 — z)” “dz. 


Now G is a positive definite quadratic form in the parameters b*’, --- , b’, so 
that it vanishes if and only if the hypothesis is true. And if 0 < e < 1, then 
I(G, &) is a monotone increasing function of G. For by differentiating (2.2) 


we obtain 
, fs Gg’ a i som 4(N—r)—1 
1G, &) = “oS fa a 
=o v! Je, \BIR@ — g) + +1, 3(N — 7)] 
foerga — : a | hae. 
~ BE — g) +», HN — 1) 

And from a property of incomplete Beta functions, which we shall demonstrate 
in the next section, it follows that each term in the series (2.3) is positive. Ac- 
cordingly we have 


(2.2) 1(G, &) = «? dé, ' 


where 








ia 
(2.3) 
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THEOREM I. The likelihood ratio test for the hypothesis that in a population of 


type (2.1) certain of the regression coefficients are zero, 7.e., the hypothesis that y is 
independent of the fixed variables x*™", ..- , x", is completely unbiased. 


Wilks [15] has noted that the ordinary analysis of variance and covariance 
amounts essentially to testing hypotheses of this nature by means of the function 


1—)7/*% 
f= —\aiw 


Consequently such tests are also completely unbiased, since the region of rejec- 
tion is then taken to be of the form ¢ > & . 


3. An inequality relating to incomplete Beta functions. Let us write 
1 
B(u, v5) = f ea — 2)" tae (0 <t<1). 
t 


Now, 


1 u v1 1 
/ 2’ (1 —2)’dz = fa-e) + -f z“(1 — 2)” ‘dz. 
t U U Jt 


t 
The integrated term on the right is non-positive, so that 


(3.1) B(u, » + 1; 1) <~ Bw + 1,950) 


in which the equality holds if and only iff = Oort = 1. Again, since 
2"(1 i z)” + 2“ (1 = z)” = 2" (1 pre 2)”, 





we have 

(3.2) B(u + 1, v; t) + B(u,v + 1; ¢) = Bu, »; 2). 
Combining these results, we find that 

(3.3) : * ” B(u + 1,0; 2) > Blu, v;2) 


with equality only when ¢ = Oort = 1. Hence we have 
Lemma 1: If0 <t < 1, then 
Bu + 1, v5) | Bu, »; t) 
B(u + 1, v) B(u, v) © 


4. The multiple correlation coefficient. Suppose the distribution law of the 
underlying population is known to be of the form 


de _—s— 
(4.1) f(z’, ‘tay x |a** ven, x”) _ | - e B;j(z*—a CpzP) (a? al cue) 
The indices appearing in this expression take the values 7, 7 = 1, --- , ¢ and 


p,q =t+41,---,m. The summation convention of repeated indices will be 
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used, forexample, >, C'‘,z” will be denoted by C‘,2”. We shall also have occa- 
p=t+1 ; 


sion to use indices r, s with the ranger, s = 1,---,m. The set of possible values 
of the a’s, B’s, and C’s is 


Q: || Bi; || positive definite; — 0 <a’ < 7;-0 <Ci < ~. 


We shall consider the \ test for the hypothesis H that 2’ is independent of the 
remaining variables x’, --- , x”, i.e., that the parameters belong to that subset of 
Q defined by 


wo: Be=0, (k=2,---,0; CL=0. 
N 
Let us write vo = >> (2%, — 2")(x’, — 2’), and assume that the values of the 


a=1 
fixed variables r@ have been so selected that the matrix || v”* || is positive defi- 
nite. The likelihood ratio can then be expressed in the form 


a= (in y" =(1— RY), 


vd 





where 3, is the complement of v" in the determinant |v" |. If N > m + 1, 
the general sampling distribution of R’ (the multiple correlation coefficient 
between x’ and m — 1 other variates), for this case in which 2’, --- , 2‘ are sub- 
ject to sampling variation and the remainder are fixed, is 


(1 w ptr) ew? oe ree aye 











F(R’) d(R’ a 
(4.2) oe r[3(N — m)] 
Sy) — p)*(0)" (RYO P(N — 1) ta tol ooo 
” 2 x wv !IT[Z(N — 1) + wll [3(m — 1) + 4+ 9] re, 
where 
Bi; pg “ Ps 
p= ptm Pel yt = Fiesct, BIN = Uh Bul 


This distribution was first obtained by Wilks [13], although Fisher [3] had 
previously treated the two extreme cases in which (1) all independent variables 
are subject to sampling fluctuation, and (2) all independent variables are fixed. 

To simplify the presentation, let us put p = p’, 9 = }y’ and R = R’, and note 
that 9 = Oif and only if C, = 0 (p = t +1, ---, m) while p = Oif and only if 
By = 0 (k = 2, --- , t), so that 7 = p = O means that the hypothesis H is true. 
On any alternative hypothesis, one or the other or both of these quantities will 
be positive. Let the region of rejection be taken to be 

R.< R <1, 


which corresponds to 


Oo<rv<(1— BR)”. 


mee 
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The probability of rejecting H is then 





u=0 »=0 Mt v! TiR(N — 1) + 4] 
1 pore, .. Rie- m)—1 
* fer 1) + p + ®, 4(N — m)] 


We shall show that J(j, 9, R.) is a strictly monotone increasing function of A 
for each g, and that J(0, 9, R.) is a strictly monotone increasing function of 7. 


1(, 9, R.) = ay > Za- per 1)+uP "THN — 1) +4 +9) 
(4.4) 
dR. 





al 
First consider ap . We can write (4.4) in the form 


af 1 
™ 7, R. is v 7 p 1 — 5/8" re i 
I (A, 9, ) é os hi ria(N —1) + ul a p) Pur» 
where 


Bl3(m — 1) + y+, 3(N — m); Rj 
,=TE(N -1 the Sn od 
too = TH — 1) +o tol — 1) +n + Oa — wl 
Then, formally, 


G(s PB ~\i(N—1)+ ) 
=. © OM sn ow 
(x 4a - ae, 
—l 0 oe? 
= 2-0 — ern — BE — i — 1) + alone 
Taking out the factor (1 — p)**~?**, we have left 


nl oo v oo v 

vp. vp p 
~ Pu -> — an z i [3(N ‘al 1) + Bgy,» 

vl v=0 y! y= y! 


v=( 
= fous — RIN — 1) + + rey. 
v=o vt 


And the expression ¢,,»41 — [3(N — 1) + » + vle,, is the same as 


1 fBlz(m — 1) +ut+v+1,3(N — m), Ri) 
ra — 1) +e+%+1)\ SoG — 1) +a tot 1, iW — ml 


_ Bi}(m — 1) +u +, 3(N — m), Rl 
B[}(m — 1) +» +», 3(N — m)] J 


and is therefore positive, by Lemma 1. Consequently 





a 
apt y; R.) 2 0, 


with equality holding only if p = 1, or if the critical region is taken as the whole 
interval or the null set. 
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0 ™ 
We have yet to investigate ag I(0, 9, R.). In this case (4.4) becomes 


— 4 7 Bid(m — 1) + uw, H(N — m), RB 
4.5 1(0, 9, B) = 6% > BG ee, 
— 0,9, Be) = eT BihGa — 1) +a, AV mo] 

(Note that this agrees with (2.2) if we make use of the relations r = m, g = 1, 
and B" = 20°.) We then obtain 


o 5, gy [Bm —1) +u4+1,3(N —m); Ri) 
100, 9, R) = €* 2 "ae =I) +u4+1,4W—m) 
B[3(m — 1) + un, 3(N — m)] 


which the lemma shows to be positive when 0 < R, < 1. 

This concludes the proof of 

THEoREM II. Jf the underlying population has a distribution law of the form 
(4.1), then the likelihood ratio test for the hypothesis that x' is independent of x”, --- , 
a”, where x‘, .-- , 2” are fixed and x’, --- , x‘ are subject to sampling variation, 
as completely unbiased. 


5. Mutual independence of several sets of random variables.’ Let the dis- 
tribution law of the m-variate population be of the form 


B:; | a 
6.1) Bal prsteienite 


Here @ is the set || Bj; || positive definite; — © <a’ < «©. Suppose we wish to 


test the hypothesis H; that the variates {z',---,2™"},---, {2 --. a} 
are mutually independent in sets [14], where 0 = m < m < --- < mp, = Mm. 
Then the w set is that defined by 

I Bai ll = ll Baa I] 4 --- + Bas, | = |] Ball + --- +1 Boll, 


that is, we have B;; = 0 unless the indices 7 and j both relate to the same set of 
variates. 

Associated with the population of random samples Oy (NV > m + 1) drawn 
from a universe characterized by (5.1), we have the distribution function 


N 
; ww By j(zt—at) (zi—a/) 
Ar errr 





P(z; B,a) = 


ainm 
The maximum of P with respect to variations of the parameters B;; , a’ in Q is 


7 ivm 
pe = pn (XY em 


= —_ » 


1 In this and in subsequent sections an index occurring both above and below indicates 
summation in accordance with the usual convention. 
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where 


. 
v= Di (ai — a)(xi — 2’). 
a=1 
And the maximum when the parameters are restricted to w is 


r\3Nm 
P., = [v, --- vel (x) eo. 
e Tv 


where v, stands for the determinant of the v’s connected with the yu-th set of 2’s. 
Thus the appropriate likelihood-ratio is given by 
2/8 = |v"? | 
. Vi eee Up 


It is easy to see that the value of \, is unaltered if we replace z* — a‘ by 2’, 
so that we can express the probability that A, will lie between 0 and 4, in the form 


N 
i ad seated 
B** 2 Bj jt37} ‘ 
or dz, -+- dzx. 
A<A, 


I(B, A.) _ zivm 
Furthermore, A; is invariant under the operation of replacing any x by a linear 
combination of z’s belonging to the same set. And since the assumption that 
|| Bi; || is positive definite implies that the matrices || B;,;, || have the same 
property, we can transform. the z’s in each set among themselves by orthogonal 
transformations in such a way as to reduce each of the expressions 


es . . 

25 

Bui, its 
to sums of squares. Thus we have 


N 


BTN on By aja) 
(5.2) I(B, x.) = | .o dz; --- dxzy = I(B*, »,), 
h<rhy 


iNm 
where 
(5.3) Bui, = al Bre, a, (hy, tus Ju» Ky = Ma +1, ---,m,), 
(5.4) Bij, = 0 is ¥ ja, 


and the subscripts on the indices indicate the sets of values over which they 
range; e.g., 7: runs over the numbers corresponding to the columns of the matrix 
|| Be ||. From (5.3) and (5.4) it is clear that || Bj; || reduces to a diagonal 
matrix when H is true.. 

In order to show that the test is locally unbiased, we may consider the deriva- 
tives 





0 a 
* atime * 
ae, I(B*, »))) (sae apes I(B*, »))) (u # v, 0 ¥ 7) 
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for the B*’s are linear functions of the B’s; and the positive definiteness of one 
matrix of second partials implies that of the other. We have at once 


aB* a? B* 
(a ),- 0, &. a) - (u #v,0 ¥ 7) 





tyly tyly 


unless the second derivative is taken twice with respect to the same B*. Thus 


N 
al(B*, d.) Baw { et as 
a aB; ),=- 2 im a, Eatete “ae 


tly 





where the Bo indicates that the B's have the diagonal form associated with H. 
And since whenever the point My svn xi ,+++,%y3-+-+-+,2y is in the region 
\ < , so also is the point 2}, ---; —z1,---, —twj-++, 2m it follows that 





apt, HBS) = 0, (u ¥ v). 


Similar considerations show that the non-repeated second derivatives 


N 
a2 *3N _— N — 2 Bt ;o7i2) 
20a, 18) = 4 J, (Hatet) (Zeavar) oa 
must vanish. 

Finally, we must show that the repeated second derivatives are positive when 
evaluated at a point in w, except of course in the trivial cases 4, = 0, A, = 1, 
when they must be zero. In order to do this, we shall make use of the fact that 
the v’s which go to make up J have the Wishart distribution [17] 


4(N—1) 
(5.5) B 


tt te or 


qiom(m—1)) Il r[3(N _ i)] 
i=1 


(Because of the relation v’ = v”, only 3m(m + 1) of the v’s appear as differen- 
tials). It will be useful to have the notation 


Beep 
A © err rrsereen 
gtirie-)) I T[3(N - ‘)] 
V(B, N —_ 1, m) = yh (N—m)—1 5 Bs 508) 


With the aid of (5.5) we shall now compute the moments 


E((Q””*)'), h=0,1,---, 
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for the case in which the matrix || B;; || has the form 


Bu +++ Bim, O ++ OBin 


0 
oe a 
(5.6) 0... 0 , 
. B 
: || B || 
as. 


where || B || stands for || Bz || + --- + || Bp ||, and all other B’s, except those 
indicated, are zero. Let us designate by (0) the set of v'’ which correspond to 
the rows and columns of B, and by (v — 6) the remaining v’s. We then remark 
that the result of integrating (5.5) with respect to the v’s in (v — 5) is to reduce 
it to the corresponding distribution for the variables in the set 0, thus: 
G(B, N — 1,m) [ V(B,N — 1, m)d(o — 8) 

(5.7) 

= G(B, N — 1,m — m)V(B, N — 1,m—m), 


where || By: || is the inverse of the matrix obtained by inverting || B;; || , and 
striking out the first m; rows and columns, that is 


BY = BY (k,l = m +1, +--+, m). 
Then, 


h 
G(B, N — 1, m) {s*= V(B, N — 1,m)dv — 5) 
r, 


h 
. Up 


can be written as 


G(B, N — 1, m) es - - 
G(B, N — 14 2h, m) OB N — 1+ 2h,m) [us 0805 
(5.8) “xX V(B,N — 1 + 2h, m) d(v — 0) 
oo G(B, N - 1, m) 


- GENT wm) 928 - 1 + 2h, m — m) 


X vy"... vp’ V(B, N — 1+ 2h, m— m). 
It can be seen from (5.6) that 
|| B\| = || Bel] + «+» 41) Boal] + || Boll 


since of all the rows and columns of || B,; || which are involved in || B || it is 
only the last in which a non zero element appears outside of the blocks || Bz || , 


- , || B, ||. Consequently, the v’s corresponding to the determinants v2, --- , 
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v» are independently distributed, so that if in (5.8) we integrate out all the 
remaining v’s but these, we shall be left with a product of factors 


_ G(B,N—1,m)_ 77 GB, N — 14 2h, by) 
G(B,N —1+2h,m) i=: G(B:,N — 1, ki) 
< G(B,, N — 1, kur" V(Bi, N — 1 + 2h, ki) 
G(B,, N — 1 + 2h, ky) 
G(B,, N —— 1, a 





-G(B,, N — 1,k,)v,"V(B,, N — 1 + 2h, kp), 


where k, stands for the order of || B, ||. And this, when integrated with respect 
to the v’s in v2, --+ , vp, yields 


_G(B,N—-1,m) 7 G(B.,N —1+2h,k.) | G(B,, N — 1+ 2h, k,) 


G(B,N—1+ 2h,m) i=:  G(B, N — 1, hi) G(B,,N—1,k,) ’ 
which, because of the definition of the G’s, reduces to 


" TiN — 2) + Al ome _TRW — 7) | —h ph h @h 
I T[R(N — 1)]) too iar T[R(N — 2) + A] x BB: «++ BpaBy. 


Denoting the product of ratios of I’s by K;, , and recalling the form of || B;; ||, 
we therefore have ; 


h 
(5.9) El 3 #| = K,B\ Be 





Vg +++ Up 
with 





ie Be Oo Oy 
0 

















| 
| 








se - & | 
QO .-.. 0 | 
| | 














|| B’ || = 
: B 
4 I| Bo || 
| Bn 0 eee 0 


But it is not difficult to see that under the condition (5.6), the matrix || B, || 
is also the inverse of the matrix obtained by striking out the first m rows and 
columns in the inverse of || B’ ||. Making use of this relation, we can apply the 
Jacobi theorem to (5.9), and put that expression in the form 


yh 
E| ze | = K,B;", 
Vg +++ Up 


AE TTI IE NE ELT NL eR PSE metre 


where || B, || is the matrix in the upper left hand corner of || B’ || , namely 
| Bis |l- 








rE 
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Let the subscript 8 on a B stand for the result of replacing B;,;, by Bi,;, + 








8:,;,- For sufficiently small values of the 6’s the matrix || Bi; || will still be 
positive definite, so that we shall have 
4(N—1) vy" a 
a = * an | y.eee yiN—m—t eBii8e'! dy = Ky, Bip, 
o D ewe 
gin 1)) I r[i(N a i)] P 
which we can put in the form 
vr ae K 
’ es ght N—M)—1 Bs ggu*? = h 
(5.10) = | Ge +s 2” : dv Bi, BIO: 


Wilks [13] has shown how to generate moments of determinants by the device 
of replacing 8;,;, by 6:,;, + £:,£;,, and integrating with respect to the ¢’s from 
—«to«. Applying this process 2h times to the left hand side of (5.10) gives 


h 
en’ [ (- : ) V(Bs, N — 1, m) dp, 





1°°° Up 
which when multiplied by 7 ‘“B!~” yields 
E{Q?’*)] 


when the §’s are set equal to zero. 


To obtain the value of this expression, we may perform the same operations 
on the right hand side of (5.10). But before so doing, we shall put Bs, in a 
more convenient form. We have 


Bs = Big-B — Bin- BB””. Big, 


where B”” is the inverse element of Bmm in || B || , and Byj' is the cofactor of 
Bug in Byg, the result being obtained by expanding Bz, according to minors of 
the first row and first column. Similarly, 


(5.11) B = B,-B — Bi,- BB". By". 
From (5.11) we have 
B ‘ll 
1 


we = 2 mm Xl 
ik Bim B BB’ 


so that if we put B- By’... B>' = A, we find that 


411 AU 
Buy-BY1 a Bia Ben Pa, BB 
B, Bl By 


B By 
By - pul — A) zt 


Bs 
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Thus the result of multiplying (5.10) through by B'’~” (where no ’s are 
substituted in this determinant) can be put in the form 


B R(N- 4) ae —4(N-1) - 
_— (5) {1 gm A) eet Bil 


Expanding the expression in curled brackets, we get 


Bi; 


4(N-)D pi(n—1) = T[2(N — 1) +9] 4] —[4(N—-1) +h+y] » 
aon ate Sua — ot gn) Balen — a 


| 
| 
| 


If we let Big, stand for the result of replacing By, by By, — tin Big , we can write 


this as 


ion sp TEV — 1) + 
(5.13) =o vIT[3(N — 1)] 





| (1 7 A)’ (BY) “* gite-t+ 


Ti3}(N—1) +h] 2” Byiter-P +4 


* TN —1) Fh +0) 





the derivatives being evaluated at t = 0. 


Now Wilks’ results show that the operation of introducing §;,;, + &:,£;, into | 


Big: to replace 8;,;, and integrating with respect to the ¢’s, when repeated 2h 
times on Big”? ™ , produces 


amt By? 1) rz TIZ(V = i)] 
i=1 ris 1(N — 1) + - hl 





when the §’s are finally set equal to zero. Reversing the order of summation, 
differentiation and integration in (5.13), we thus obtain 





oo ft TBO) gr $$ PBO-) +2 
(5.14) int I[3(N — 2) + h] y=0 vIT[Z(N — 1)] 

_ My» B—D+y r[3(N — 1) +A] ( —1N- 4 
Now 


-uw—-p\ _ THN — 1) +9) pony p—-aw- D+ 
(ai ), ~ TRI — 0] ay Ri 


so that (5.14) becomes 


it TERN — 4) + A] = vIT[A(N — 1)] 
a— ay BW - D+ ra -1) +4) 


TT BW =a) orn $F TH — DI 





r3N-—1)+h+>] TRW-)) 


paige Dem Ms SARE SV RRRIDS, HSE 


PR ae MERE 1 BEG 


SEITE RY LED 


TOR SCION 


RET ih ASTIN PORN BS 








are § 


rite 


)+h] 


nto 





' 
: 
- 
4 


EEO A NI 


SALA IOL LILY LIN SS TER, BELEN ON GMT ETERS YSN 
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From this it appears that the h-th moment of d7/” is given by 


m wi _. oa iy 
El(?*)"] - II r[2(N t) + Al T[R(N — 3)] 





onl TERN — 1)] mam T[2(N — 1%) + h] 





1W—D Sy _ ay FER(N — 1) +4) 
(5.15) x Aker? Do = A) IT = DT 


ris(N —1) +>] TiN -1) +A 
MsN-1+ht+yr) ria(V—-1})) 

A considerable amount of cancellation will take place in (5.15), for m is greater 
than any k,. Suppose the largest k, is k,. Then we can cancel its product 
into the first one, with the assurance that there will be at least one factor 

r3(N — 1)] 

T3(N — 1) + Al 


to cancel the corresponding factor under the summation sign. Hence we have 


x 





(5.16) 


Pe LRN —1) +4) Pp Tp, _TBW — 91 
E we es 2 ; , , "” 2 
(on) ode T[3(N —7)] iat I T[3(N — 7) + hj 
—» F TIR(N —1) +) TRCN —1) +49] 
AMY 1) 5 A 2 : 2 
«A 2 =I) TOY = 1) + B+? 
where II’ indicates that t’ has been omitted, and II” indicates that one factor 
(5.16) has been cancelled. Then we can take out the factor 7 = m in the first 


product, putting it under the summation sign, where, together with the final 
factor in each term of the sum, it gives rise to the combination 


r[k(N — 1) +9] TERN — m) + AIPIA(m — 1) + >) 
T[3(N — m)]T'[3(m — 1) + »] TER(N — 1) +h +9] 
After making this reduction, we obtain 
ayy TW THIN-*) +A) fy, te, TRIN DI 
Boe) = TT I’ ga moe 
svn So gy PERN — 1) +9] BIA(N — m) +h, (mm — 1) +9) 
x wo 2 A) ina Dl BEG = m), Fm — 1) +] 


The products of ratios in the first part of (5.18) are of the type discussed by 
Wilks in connection with integral equations of type B[12]. It follows from his 
results that \7/” is distributed like the product 


(5.17) 








(5.18) 





2-0; = ++ Om: (m’ = m — ky — 1), 


where z and the 6’s are independently distributed, with the distribution of the 
@’s given by 


sel 1 T'(c;) bs— — 9.\ct~e-1 
fist) = Dera =e 8 
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where the b; and ¢; are constants which depend on N, m, and the sizes of the 
blocks, but not on A, and the distribution of z is given by 





y — 3(N—1) = 2 (N or 1) + y] rrr = gona 
F@) = 0 2 - a) Sip a SG SD 


Consequently, the probability that A lics between zero and X, is 


xw—p [ > ris(V — 1) +9] 
A;h) = a” a So 6 Sie 
J(A, do | y vIT[R(N — 1)] 
een a eo 
B[3(N — m), 3(m — 1) + 9] 
where the integral is to be extended over the region 


x f(@) dz dé, 


£0564... % <3", @6461, ©6251. 


Let us integrate first with respect to z and then with respect to the 6’s; we have 


po J en NE NE ELI OE ee 


saa 4(N—1) - o_ y The (N a 1) + v| 
suo J(A, me) = [. vem Ye vIT(RV — 1] 
x BW — m), 3m — 1) + v5 el seg) ag, 


B3(N — m), 3(m — 1) + 2] 
where S, is the set 110; < \2’" ,0 < 6; < 1, and 


? 
B’(u, v, g) = / 2” ‘(1 — 2)" ‘dz 
0 { 
(5.20) ; 


1 
= [ 2” (1 — z)" ‘dz = Biv, u, 1 — 9), 
1—¢ 


y(@) being the upper limit for z for fixed 6. It is clear that the subset of sg for E 
which ¢(@) < | will not be of measure zero in the 6-space, since we assume that | 
o<aA, < 1. 

The relation between (5.19) and the corresponding expression for the multiple 
correlation coefficient without fixed variates--the case 7 = 0 in (4.4)—may be 
clearer if we put 


(5.21) p=1-—A = Bi, B'" By, 


‘ 
I 


where B”” is the inverse of Bmm in || B || , and By’ is the inverse of By in || B, || . 
Then the required probability of rejection when A has any fixed value is 


_ 2) p xw—» TI(N — 1) + 2] 
1(3,1-% y= [ EFa- a) AGP 


B[2(m — 1) + », 2(N — m), 1 — ¢] 
B[z(m — 1) + », 2(N — m) 
where we have used the relation (5.20) between the incomplete Beta functions. 
Differentiating with respect to p before performing the integration with respect 


Xx f(@) dé, 











Po EN FE EE NPT MER IE 


¢ 
B 
' 
¥ 
e 





LIKILINOOD-NATIO TESTS FOR INDEPENDENCE 17 


to the é’s, we find by a computation similar to that in section 4 that each term 
in the series is positive except where ¢(@) = 1; so that we have 


al 
pp eb — de”) >0 (A. ¥ 1,0). 


And by (5.21), we then have 
ar 
Bim 


Since the argument is clearly independent of which B;,;, (u # v) we take, it 
tollows that the test is locally unbiased. We have therefore proved: 

TueorrM III. Jf 2’, --- , 2” have the joint normal distribution (5.1), then the 
likelihood ratio test for the hypothesis that the x’s are independent in sets is locally 
unbiased. 

In certain types of statistical material it may be important to consider, not 
the independence of the z’s themselves, but of their deviations from regression 
functions. For example, in the case of several related time series, it may be 
desirable to eliminate the trend of each x‘ by means of, say, a second degree 


polynomial in ¢. Consider then in general a population whose distribution func- 
tion is of the form 


> 0. 


1 
B @ Biile*—Cizm) (2i—Ciz”) 


dm (u,yv=m+1,---,m+qQ) 
T 


with unknown B;; and Ci. The likelihood ratio for testing the hypothesis H; 
that the sets of deviations 


1 1 m —i+t1 —)+l 
g ~ Cor, +++, 8° — Oey a OO Pe, oo Ft ~ Ee 


tj 4N 
yt tek, 
aes 


d® = X(ri, — Cire) (ai, — Ciz’, 


are independent is 


where 


and C’, is the usual least squares estimate of C;, , given by 


7 uy w 


Ai ge? 
Cia =a 


‘with 


a’ = 226%e (r,s =1,---,m+q). 

An examination of the characteristic function of the d” shows that their 
distribution law is the same as that of the v’’ of the preceding discussion, except 
for the fact that N — 1 is replaced by N — q. Consequently the above results 
on freedom from bias, and also those of the next section, apply equally well to 
the A, test for the independence of deviations from regression functions. 
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6. On thé moments of \7’". Although we have succeeded in proving the un- 
biased nature of the preceding test only in the local sense, we can show that the 
moments of the criterion \7/” have a property which seems very closely related to 
that of furnishing a completely unbiased test. For it can be shown that each 
of the quantities 


E{(a”"”)"] h = 3; 1, 13, o 
is greater, when H, is true than when any alternative H’ holds. It will perhaps 


be sufficient to prove this statement in detail for the case where h = 1 and 
where H, is the hypothesis that the matrix || B;; || has the form || Bo || + ||B:,;,|| : 


| Bu By 
0 
Bu Br 
Bss Bas 
Bs Bu 
0 0 | Bii3 || 


in the notation of the preceding section we then have 
4, ji = 1, 2,; te ,j2 = 3,4; 73 , Js = 5, --+ , mM. 
Even when H is not true we find that 


ee G(B,N—1,m) G(B,N —1+2h,m-— 4) 
6.1) Ello "|v |"] = ’ ’ ; _ , 
— oe 3 G(B,N—1+2h,m) G(B,N —1,m-— 4) 
where B**** = B***, Using the definition of the G’s in section 5 and the Jacobi 
theorem, we can write (6.1) in the form 


E{| v4 |* | o** |") = K,B* 





where B is the determinant of the matrix composed of the first four rows and 
columns of || B;; ||. In the general case we therefore have 


Be Me Bn Bu 
Ba Be Be Mu 
Bu Bu Bs Bull 
RM, Ma Be Ba 


|B || = 


Thus if we set h = 1, and replace B;,;, and B;,;, by Bi,;, + EE? + EMPEY? 
4 1 
and Bij, + €2&> + ePe respectively, indicating this replacement by a 


prime, we obtain 


(6.2) E{(v?’*)'] - z / Biv) Bre) Bo df. 





A ORIOLE MEET SEE 





<tr ee ew? 


A OE RE MEER ‘ 


RE IT I ot 





LIKELIHOOD-RATIO TESTS FOR INDEPENDENCE 19 


Treating B’ as a bordered determinant, we can reduce it to 
B’ = Baw (1 + Bandi es? 

= Ban(1 + Bases en 1 + Barge 

= Ba(l + Baseper yd + Basepe?y(l + Bawere? 

= BA + BEEN + BaSEren yd. + Baseren ) + Bushee’), 
where the subscripts on the B’s indicate the sets of ¢’s still contained in the 
determinants, and || B” || = || B,;||\"’. Similarly, 
(6.4) B’ 

= Ba + Beers? +. Bayarer yd. + Bagee? yd + Buse? ey), 
the inverse now being taken with respect to || B |. 

But between, say, Bus} and Bis , there is the relation 


where || Bas), ;, || = || Bi2} ||", that is, the inverse of the matrix obtained by 
deleting the first four rows. and columns of || Biiz) ||. Consequently 

#272,(3) ' Bi? (3) { 

(12)E¢ Bii25kis 


with equality holding only for those values of the ¢’s for which 
e Bus = 0 iz = 5, ---,m. 


And this set of ¢’s will not make up the entire ~ space unless || B;; || = 


|| B || 4+ || B;,;, ||. Applying the same kind of reasoning to the other quad- 
ratic forms in (6.4), we can therefore show that 


| BiMN-D Bre-) Bo dé 


<b fa + Begs or .. -(1 + Bilis, ef (4) "t') Meaahied 3 


The last form can be reduced to a sum of squares with unit coefficients bya 
linear transformation of the ¢“”’s; thus 


[a Brite) Br dt 
(6.6) 
<b / | Bil}, ad + Big) ye... : -(1 +4. rE + ies dé. 
And by making use of the fact that 
Bait) = Bass-| Bax isis |, 
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we can express the right-hand side of (6.6) as 

Bo +f Baan +| Boomers, ta + meee i @ + eee dé. 
This in turn becomes [c. f. (6.4)] 


Bf | Bossi 1 + Bg ee) + Bae ey 
x (1 + Biiize! ~ ef)- ad G + ref ef?)- k(N+1) dé 


= [| Baresi: [10 + BME? ey + Bist HO 
= (4) -i(N 
K (1+ Ee) + zeit 9g) VAD ge 
At this stage we can write 


| Bayigi, | = | Bay | + BY*MEsPEP yl + Bay EPe?), 
where || BG? || = || Bayi,;, ||’, and apply the relation 
Bas” = Bay! — BG}BoainBa", — || Barisis || = || BS? 7 


Therefore, 


Bei g® gOS Birir g@ g@ 
unless £{7 Bij}? = 0 (é2 = 3, 4). We can thus continue as follows 


at hee ie dé 


< | Biz, | | (1 + BPs g™ ey MND 4 + at eye 
x (1 + se ef) + xi ar dé 


(2) 


Transforming the £*’’s, we get 


| Bis, js / Boy? r 41 + B*s141 gf) gO rma + DE? yr 
x (1+ BE + Beles) tr ade. 


Since | Bo fiat [* = | Bayz; |, this becomes 
| Bi rfc (1+ prsshs gO 9) - IN (y ure 
4171 1 t1 
x (1 + rE? e)- ancy + ze? a) 4(N+1) dt 


=-fa + pee Py a + se? e —k(N+1) 


x (1+ zEy? )- NY] + zzi° eyo dé. 





og RRM R EIT” 


rr 
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Collecting these results, we finally obtain 


gi“ Ro dé 


<K | (1 + De ey + SPE) 
«K (1+ EP ef) 1 + rele eo) ak 


with equality only in case H;, is true. But the right side of (6.7) is the first 
moment of Aj7/™ computed under the hypothesis H; , while the left side gives 
the corresponding moment in the general case. 

The possibility of carrying out this reduction for the case in which the matrix 
|| B || has more than two blocks, or blocks of unequal size, seems sufficiently 
clear. And to obtain higher moments, we have only to introduce the proper 
number of é’s into each set. We then have: 

THEOREM IIIa. Let X; be the likelihood ratio appropriate to testing the hypothesis 
H, that the normally distributed variates x’, --- , x” fall into the mutually inde- 
pendent sets x',---,2™';--- 32" ...,2”". Then the expected value of 


ory. h = 3,1, 13, --- , is greater under the null hypothesis H, than under any 
alternative hypothesis in Q. 





7. The general regression problem. Let the variates 2’, ---,2' be: dis- 
tributed according to the law 
(7 1) | Bi 1 @ Bii (2*—Cha—C ga") (zi—c$2"—cizr) 


qt 


Throughout this section, let the ranges of the indices be 


7,j=1,---,t P,Qd=t+1,---,m 
r,s=il,---,m r,s =1,---,t+¢ 
wpvy=t+1,---,i+q oor =t+qrtl,---,m. 


In (7.1) we therefore have ¢t random variates, and m — t fixed variates. Con- 


sider the hypothesis H that the x’ are independent of the last set of x’s, namely 
x. We have 


2: || Bj; || positive definite, —0 < Ci, < o, 
while for w we impose the additional requirement 
C, = 0. 


Thus in general we have for the distribution of random samples Ov , N > m, 


N 
(7.2) | Big PY By Bas eb cheB) ahh 
, Nt 
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while when H is true, we have 


N 
a” | Bi ‘aad - 1 By j(z§—Chzk) (zi—ciz%) 


(7.3) P= 





Differentiating (7.2) with respect to the B’s and C’s and setting the derivatives 
equal to zero gives us the conditions 


N N 
(7.4) z Ciatst = > zizt, 
a=l1 a=] 
ee 2 oe ; : ‘ 
(7.5) Bu=5 & (xi, — Ci.x?)(2i, — Cizt). 


As in section 2, we put 
N 
a" = > air. 
a=1 
and assume that the fixed values x2 have been so chosen that || a” || is positive 
definite. Then (7.4) and (7.5) can be combined to give 


BY = 2 (a — arated) = 2a! 


where || a> ||’ = || a" ||. It ‘then follows that 
aia (NV 
Po = lar (2) "em, 
Similarly, 


int 
_ | sti |-4N Nv —4N 
P. | a | (7) e ’ 
where 
ai! = a? — aaa”, |I ay || = [a I. 
The matrix || a” || will be positive definite except for a set of probability zero, 
so that we can consider || @’’ || as the inverse of the matrix obtained by removing 
the last m — t rows and columns of the inverse of || a” ||, and || a’ || as the 
inverse of the matrix obtained by removing the last q rows and columns of 
\|a”’*’ ||". Then by the Jacobi theorem 
oo Pa | ee 
ai = |e lait = 1S 
lar’ [a 








so that the appropriate likelihood ratio is given by 


pIN la” | |a”” | 
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It will be advantageous to complete the matrix || B;; || in (7.1) by defining 
Bip = —B;;C}, 
By, = CZBi;C}. 

(Evidently B;, = 0 for i = 1, ---,¢ and fixed p, if and only if C4, = 0, 


(7.6) 





j =1,---,¢). We can now write (7.2) as 

| " > i4.pih i+ pik 

Bi; vie on Byj(zi+Bt Bhpt®) (z2+Bi Bkgz2) 
(7.7) P(z, B) = ~iNt 


We next notice that \ is invariant under the transformations 
x’ > ajx’, x’ — Bir’, 
so that if we put 


I(B, i.) = [ Pe, B) dz} --- dry, 
8s 


where the integral is extended over the region 


BS: @sSsasiX, 
it turns out that 


I(B, \.) = I(B*, d.), 


provided 


* k l * k * k T 
Bj; = a; Bra; , Bi, = ai Bip, Bi. = a; Bir Bo. 


To prove the locally unbiased character of the test, we may therefore consider 
the derivatives 


- I(Be, .); speape 1B. r.), 
and assume that || Bj; || and || a” || are in diagonal form. We also observe 
that \ is unaltered by the transformation 

zt’ — a + BYB,, 2". 
We therefore have 


N 
* a . ‘ ; ; 
| Bi; ia [ ant BY j (zgt+B** Bh ,zg) (z,+B*i! By ,27) 
8 


aint 


I(B*, Ae) ar 


z. 


Thus, 








N 

# ” ~ = Bt; zizi 

a I(Bo, .) = —2 Sed [ dL, tarec — 
ko oa 
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which is easily seen to be zero. Again, consider a non-tepeated second partial 
derivative, say 
2 
a ¥ 
—,——; 1( Bo, ») 
* * oo 
OBiw dBi, 





N 
¥ iv * N - & * wtat 
B;; . bk? ¢ > k - om Pritata 
= —2 ei BY a — 2 : rr. >, t3tg}e “' dix. 
= 8 a=i B=i 
This piainly vanishes if k ~ 1; but it is by no means easy to see what happens 
when k = 1, even when o ~ 7. Let us therefore study the distribution law of 
2 
\”/* for the case, 
Bis = Q, 1 # i. 


(We shall not, however, assume that the transformation B — B* has been 
made on the B’s.) 


Define 
Bu = By — B,:B" Bi, 
i” = a7 — a%a,,0", 
where || a,, || now stands for the inverse of || a“ ||. These expressions will 


arise when we adapt Wilks’ method of moment generating operators [13], based 
on the identity 





(7.8) [ cr" act .oe dry = 2°™*B exp (—B,,a") ; 
to the problem. We shall understand from now on that B = | B;;| and | 
\| BY’ || = || By; ||". Let us rearrange the form in the exponential on the f 
right, thus: 


B,,a"" = (B,a” + 2B,.0" + B,,a” — 2B,:B’B ja" 
— B,,B"B,,a"a,,a") — BiB’ Ba" 
= Q — B,:B"B;a" 
= Q — By;;. 
A subscript 6 will denote the result of replacing B,s by Brs + Bys, and a ) 
prime will indicate that each 6,-.. has been replaced by 6,5 + &€s. Consider 


now the result of integrating the right hand side of (7.8) after these replace- 
ments have been made: 


ETE ITO 


4Nt 


” [a exp (—Byqsa”) dé --- dé: dfs +++ dé erg 


= 7M! / By ties ( / o*b dey) dé; , 


(7.9) 
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Let us integrate first with respect to the &. Wilks has shown how to write 
Q; in the form 


/ B v 1 v 
Qs = —Qis + Byosa” + Bi a” t,£, — 2Byig Bz? a” &5&,, 
where 
Qis = Byis Bs’ Bysa” + 2B,i3 Bs” Bjea” + B,:Bs” Bi, a” aya’. 


This latter expression is thus free of the &. Consequently, 
_ Bs\"* ba ; 
feta = (BP erancmernone 
8 


/ 


Bs i y 
Qos = B, ww (a"” Bris B;' "t)(a * Bors Ba” £1), 


where 


which can be written 
Bs Nij kl uy 4ij kl uo 
B; {Byis Bs’ Bug Bg’ &;§0" + 2Byip Bs” BorBs §j€0 


+ Bai By” By B;" &;£,a™ 7 | ‘ 


The method of reduction used by Wilks can now be applied to Qig and Qe , 
and gives 


Qis + Qos = ByisBs’Bjga” + 2Byis By’ Bjoa”” + Bi By’ Bia aya", 
an expression which does not involve the ¢’s. Thus 
(7.10) [e% dé, = a | a” \? Bz** e**. Bs’. 
Now the quantity 


/ 
ea isi — >> yiBs") (ysi = a 


v=m( 


? 


where B” stands for the cofactor of Bj; in || B;; ||, can be expressed in terms 
of Bs , provided we use our assumption that Bi, = 0,7 ¥ 1, whereupon yi;Bp” 


reduces to the single term yBs". In fact, we have 


t ’ 
Elgs| a" |"] = K IT WN —m+t+1—i, 2h) |a™ |ha'”* Beer 
(7.11) ” 


B ey -to —op So (yBs')” p—tacw—a)+a+0) 
X exp (— By ga”) = KI yea t®*| 2 |* Bode eee 7 i Bz Broth 
_ ¥ { 
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where, following the notation used by Wilks [13], 
= ete, R= BM exp (—Byea, 


_ Ti+ d)] 
v(a, b) = ~ Tia] : 
And (7.11) can be written as 


Elga|a™ |"] = Kr'™* TT y-| a” |" Bs**e"% 
t=1 
(7.12) . 
y T[s(N —q) +A] ' peony, 
rao VI T[R(N — g) +h + >] = 


where B, stands for the result of replacing By by By — u. Changing B,-. into 
Bes + &€ and integrating, we then find that by virtue of (7.10) 














Elge\a™ |"|a"*’ | 7] = Kxi™* Iv. a” |*| a" |* Byte % 
_— f [3(N — q) +h] 
au Ty THN-gt+hl a a 1—1hw—@) +h] |. 
_ »=0 v! I[3(N — g) +h + v] ow’ Bou a 0 
Now 


—-- dé; - ee we” Il Y(N —4q + 2h +1-— i, —1), 
i=1 
so that (7.13) becomes 


Elga|a™ |" |a"’*’ |] = Ra! TT yw —m+t+1—i, 2h) 


(7.14) x Ty - 9+ 2h +1-4,-Dla™ Pa" 





—40 9 y’ TH(N-— gq) +A) 2” 
X-Bp ee Ore 


Comparing (7.14) with (7.12), and making use of the fact that 


a em. 





¥(a, —1)y(1 - 1, ~ 1) 7+. y(a — 2h +> 1, i) = v(a, —2h), 


we thus have 


t 
Elgs|a" |*|a"*’ {"] = Ka'™* TT y(N — m+t +1 — i, 2h) 
; t=1 


t 
x TT Win — ¢ + 2h +1 — i, —2h)| a |"| a” |" Bsc % 
t=] 


—y THIN-g +h] @ 


(Bz. 40-0) - 
v=o vIT[R(N — g) Hh+>r] aw © 


t 
£ 
f 
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Setting the 6’s equal to zero, performing the differentiation, and recalling the 
definitions of K and Qs , we then find 


El(x"")"] = Ty —m+t+1—i, 2h) yw —q+2h+1—i, —2h) 

7.15 
ie x eat > (yB")’ Tik(N-— gq +A) TRIN -—g) +24) 
= vl TR(N—gtht+>] TR(N-—@®)) - 


Taking the first factor from each product, we can convert (7.15) into 


[yey — m+e-+1~ 4,28) Tl yiw — 9 + 2h +1 - 4, ~28) 


x ee = (yB")’ T[A(N — m+ t) + h) THW-®) +2] 
=o! r3(N —m+t)] TRN-—g) +h+ 
This last product of ratios of I'’s is equivalent to 
ri3(N — 9) +4] Tha(m — t— 9) +l — m+t) +h) 
T[3(N — m + t)|T[¥(m —t—q) +] © r[3(N — q) +h +>] 


2! N 








Thus the moments of are connected with an integral equation of type B 
[12] and \”” is distributed like the product 


sens & 0<2<1,0<6<1, 
where the joint distribution of the 6’s is 
(0) , 
= II r[3(N = @ + 1— 1)] Gene ete a gerne 


me TW —m+ti+1—)Ta™m—-t—@) * 
and z is distributed independently of the @’s with the distribution 


os agg oo (yB")’ 
Cm Me Er ae aeeei- ete 


The probability that 0 < A < X, is therefore 


pes, oe i 





Iy,>) = [ f(0)F(z) dzd®, +++ dB:, 


where S is the region 0 < @.--- 0,2 < 2" Putting ¢(@) for the upper limit 
of z in S for fixed 9, and S, for the projection of S into the @ space, we then have 


~spit (yB")’ gi(N—mte— ‘Ch z)im-t-o)— l+p i} 
1y,») = [ S0)4e ye Paw oer ooo epee 


If we replace z by (1 — z) we then find 
7 ™ 
wr) = [ 





(7.17) 





_yaur & (yB")’ Bif(m — t — g) +, 3(N —m+;1—- a 
st 2» v! B[4(m — t — g) + », 3(N — m+ d)) - 
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As far as y is concerned, (7.17) is essentially the same as (2.8). The computa- 
tion which was made there, together with the type of reasoning employed in 
the latter part of section 5 in connection with the independence test for several 
blocks, then shows that 


si rn) >0 (0 <e<)). 


Remembering that 
, > a’ BaBu ’ 


2 
( . .) ” 0, ~ : = 2a”, 
aBa 0 dBa aBn 


and we remark that the assumed positive definiteness of || a” || implies that of 
|| a” ||. Hence the relation 


a° al 
(som: dBi Ty, . 0), = (2), « 


together with the fact that we could have obtained the analogue of (7.17) 
under the assumption 


we see that 


B, = 90 1~W, 


where 7% is any fixed number in the set 1, --- , ¢, shows that the matrix of 
second partial derivatives is positive definite when H is true. 

Thus we have 

TureoreM IV. Let x’, --- , 2° be normally distributed about means which are 
linear functions of certain Sool variates x‘**, ... , 2”. Then the likelihood ratio 
test for the hypothesis that the distribution of x’, --- , x depends only on a selected 
subset x**?, ... , 2°** of the fixed variates is lesuilly othe. 

The cunt of this section has its most immediate application to those problems 


in the analysis of variance which require simultaneous consideration of several 


interrelated dependent variables z, ... , 2° in conjunction with a given set of 
independent variables a")... ,2™[15]. For the usual hypothesis to be tested 
in this case is that x’, --- , x‘ are jointly independent of, say, 2‘***?, ... , 2”. 


To return to the general case of (7.1), the method of this section can also be 
used to test the hypothesis that the regression coefficients referring to the x’ 
have particular values, say 


ci = Ci, Cw Ess 


,t;0 =t+qt1,---,m, 


the remaining C’s and the B’s being left unspecified. Since we have 








a’ — Ciz” — Cia? = 2° — Cir" — (Ci — Ci)x? — Ciz’, 





t 
t 
; 
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by the device of replacing zs by x, — Chor’ , we can reduce this problem to 
that of testing the hypothesis that 


ci =Ci- ch = 0. 


Similarly, the problem of testing whether the linear functions ui = a3C! have 
specified values uso comes under the same heading [7]. 

A particularly interesting case of the general regression problem is that in 
which m = t + q + 1, so that the null hypothesis H states that the chance 
variables x* are independent of the fixed variate x”, though they may depend 
upon z‘*’,---,2™ '. In this case we are able to find the exact distribution 
law of \”” without assuming that any of the regression coefficients C’ are zero. 
For the quantity 


00 v - b ‘ 
(7.18) , a )” Botr-o4nen 


v=0 


which would have occurred in (7.11) had it not been for the restriction B;, = 0 
(¢ ~ 1), can now be expressed in terms of Bg even without this restriction. 
By definition 


yi;B" = a"" B"B. Bu; 


and the vanishing of the Bn; is equivalent to the vanishing of the regression 
coefficients C;, associated with x”. And since 


| Bi; _ ua” BmiBm; | = B- ua”” BY Ba:Buj , 
we can wriie (7.18) in the form 


1 TRV -g@t+A) @” 


So eae ee [Bp ite-or*)) i 
v=0 v! r[3(N _ q) + h a v] ou” Bu u=0 > 





where 
|| Bau || = || Bijs — ua” BmiBm; || 


is positive definite provided u is sufficiently small. Thus the moments of y 
can be found from (7.15) if we put a”"B’BniBmn; = yi;B’ in place of yB". 
Moreover, it can be seen that when the value m = t + q + 1 is substituted 
into (7.15), that expression reduces to 
—y,;Bii SS (yigB’)” BIZ(N — m+ 1) +h, 3m — q — 1) +9] 
El(x2*)*) = e7¥ei8 (yas P) OW ’ 
T= eS A ME es ee -OFa 


so that \”’” is distributed like w, where 








_ -visnti > (Ys BY)” gets nn Gree 
(7.19) fw) =e ) i ai -ssiw ee ese 
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The distribution law of \”’” for this case is thus closely related to that obtained 
in the treatment of the regression problem with one dependent variate in 
section 2. Applying the argument used there, we can obtain: 

THEOREM [Va. The likelihood ratio test for the hypothesis that in a population 
of the type (7.1) the variates x’ are independent of x"—the caseem =t+q+1 
of Theorem IV—is completely unbiased. 

If we specialize the problem somewhat further, considering the case q = 0, 
za = 1 (so that m = t + 1), we find that the likelihood ratic takes the form 


wv 1 1 


1+ Noy #147’ 


N 
where v®? = >, (zi, — 2')(21, — 2’), and T is Hotelling’s generalization [5] 
a=] 
of Student’s ratio. In this case we are testing the hypothesis that the x* are 
distributed with zero means. The exact distribution law of 
1— )7/% 


7 A2/N 


was recently published by P. L. Hsu [6], who obtained it in a very elegant 
fashion by means of the Laplace transforni. He has also shown that the re- 
sulting test is most powerful in the sense that, of all critical regions S for which 


P{x Cc S} = € + }aBb:b; + R(b) 


(where ¢ and a are independent of the B’’ and of the means b;, and R is an 
infinitesimal of at least the third order as all b; tend to zero), the critical region 
defined by 


Ss: Fas. 


has the largest possible value of a. Tang’s tables [11] make it evident that 
this largest possible value of a is actually positive and that the test is in fact 
unbiased for all values of the b’s when e = .05 or e = .01. The results of this 
section may be used to show that this property extends to all probability levels 
other than e = 0 and e = 1. 

The application of Hotelling’s 7 is by no means confined to the above case. 
Other hypotheses which can be tested by means of this statistic are discussed 
by Hsu [6]. In addition it is now known that the Studentized D’, devised 
by Mahalanobis for measuring the “distance” between two normal multi- 
variate populations, is proportional to Hotelling’s T. This fact is pointed out 
by R. C. Bose and N. Roy [1], who have obtained the exact distribution of D’ 
for the case in which the two populations from which the samples are drawn 
are assumed to have the same matrix of variances and covariances, but are 
allowed to have different sets of means; their work, however, is quite independent 
of Hsu’s. They also note that D’ is proportional to the ratio which arises in 
Fisher’s method of multiple measurements [4]. 
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8. Summary. The method of likelihood-ratios is of practical as well as theo- 
retical importance, because it provides a unified approach to the problem of 
testing statistical hypotheses. In this paper we have investigated many of the 
tests which this method yields when applied to hypotheses about sets of re- 
gression coefficients and covariances in normal populations. By studying the 
probability functions of the corresponding )-criteria we are able to show that 
these tests are ‘‘good,”’ in the sense that they are unbiased even for small samples. 

Among the completely unbiased tests which can be based on the likelihood- 
ratio method, our discussion includes: the multiple correlation coefficient, with 
or without fixed variates [13]; Hotelling’s generalized 7 test [6] and the sta- 
tistically equivalent ‘“‘Studentized D*’ [1]; the ordinary analysis of variance 
and covariance for orthogonal or non-orthogonal data [11, 16], as well as related 
tests of linear hypotheses in the case of one chance variable. 

With respect to the analysis of variance for two or more variables [15] and 
certain other hypotheses regarding regression coefficients in multivariate popu- 
lations, though there are indications that the tests are completely unbiased, we 
have succeeded in demonstrating this property only in the local sense. 

Finally, the likelihood-ratio test for the hypothesis that the variates fall into 
certain specified mutually independent sets [14] is shown to be unbiased, at 
least locally, and has the additional property described in Theorem IIIa. 

In conclusion, much more than a word of acknowledgment is due to Professor 
S. S. Wilks of Princeton University, to whom the writer is greatly indebted for 
advice and encouragement. 
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STATISTICAL SEMINVARIANTS AND THEIR SETIMATES WITH 
PARTICULAR EMPHASIS ON THEIR RELATION TO 
ALGEBRAIC INVARIANTS 
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INTRODUCTION 


An important portion of algebraic invariant theory has been that devoted to a 
certain class of invariants called seminvariants, semi-invariants, or more rarely, 
half-invariants. Of these terms, “‘seminvariant’’ seems to be the one now 
commonly accepted. The same three terms have been applied at various times 
and by various writers to a system of moment functions of importance in sta- 
tistical theory. The statistician using these terms has frequently done so with 
an apology for appropriating’ a term of the algebraist. As a portion of this 
paper we shall show that the moment functions of this system are actually 
algebraic seminvariants, and that there are other systems of moment functions 
which are equally entitled to the name seminvariant. 
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The study of the statistical seminvariants of a population leads naturally to 
consideration of the problem of obtaining from a sample unbiased estimates of 
the value of these seminvariants. Estimates of this kind have been defined 
and computed by previous authors, but no simple method of obtaining the 
estimates has been given. In this paper a simple procedure for calculation is 
given and it is furthermore demonstrated that these estimates form an important 
phase of statistical seminvariant theory. 

The system of notation used for moment functions is that of R. A. Fisher, 
although the actual letters used in representing particular moment functions are 
not altogether the same as those used by Fisher. In general, a moment function 
of the population has been indicated by a Greek letter, the corresponding sample 
moment function by the corresponding English letter and the estimate by the 
corresponding capital English letter. 

A list of references appears at the end of the paper. Each reference has been 
assigned a number and this number placed in square brackets is used in the body 
of the paper to indicate the reference. Pages of the reference are indicated by 
additional numbers inserted in the parentheses and separated from the reference 
number by a semicolon. 


I. THE RELATION OF THE ALGEBRAIC SEMINVARIANT THEORY TO THE MOMENT 
FUNCTIONS OF STATISTICS 


. — . —E 
TE TL I ee Ey 9 


The purposes of this chapter are: (1) to review briefly and give adequate 
references to certain important phases of algebraic seminvariant theory, (2) to 
apply this material to the moment functions of statistics. 

1. Definitions. Any function of the coefficients of the binary form 


(1) f=> (") aX""¥', ay #0, 


‘=o \t 


aE arn 


Ce TERT EN SEITE 


which is invariant under the transformation 


v1 Ya 
61 de 


(2) X = yf + yen, Y = b:& + d07, A= ~ 0, 








is called an invariant of the form f. See Dickson [1; 31-36]. 
Any function of the coefficients of f which is invariant under the trans- 
formation 


(3) X=é&+/Yn, 


is called a seminvariant of f. 
The two operators 


Y = 9, 











» 0 = ' 0 
(4) 2= 2 iai—, O=LV(n-i+ Da —, 

i=1 da; i=1 da;-1 
are of fundamental importance in the theory of algebraic invariants and semin- 
variants and, indeed, invariants and seminvariants may be defined by means 
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of these operators. A necessary and sufficient condition that an homogeneous 
isobaric function of the coefficients of f be an invariant is that it be annihilated 
by both 2 and O. See Elliott (2; 113, 124]. The necessary and sufficient 
condition that an homogeneous isobaric function of the coefficients of f be a 
seminvariant is that it be annihilated by 2. See Elliott [2; 127]. 

It should be noted that there is nothing in the definitions above which requires 
that invariants or seminvariants be integral, although usually only this type is 
discussed. In what follows we shall find it more profitable to discuss homoge- 
neous isobaric fractional seminvariants, the fractional quality resulting from 
the appearance of a in the denominator. 


2. Complete Systems of Seminvariants. By direct application of the trans- 
formation (3) to f the system of seminvariants [1; 47] 


(5) A, = y (;)(-2) —, r 4 n, 
i=0 \2 ao ao 


is obtained. This system is a complete system, (2; 44, 205, 206], in the sense 
that all other seminvariants fractional in a and of degree 0 are expressible 
rationally and integrally in terms of this system. 

Other such systems can be defined. The system of minimum degree semin- 
variants, the seminvariants of even weight being of degree 2 and those of odd 
weight being of degree 3, has played an important role in the algebraic seminvari- 
ant theory. Elliott [2; 207-209] discusses this system and gives the general 
formula for the even weight seminvariants of the system. So far as the present 
writer has been able to discover the general formula for the odd weight semin- 
variants has never been published, although Hammond [3] may have obtained it. 
After some lengthy but not difficult computation the result has been obtained, 
so that the last mentioned system of seminvariants is completely defined by 


on’ . i (2r\ ai deri 
Cx = 5d (-1) yo. 


a F a i+r 2r 2 +- 1 Ori Or+i+1 
(6) Coss = 2 ( 1) (7 Joti 


2r 
+ > (—3)** ) Sages 
s=0 a 


+ 0 











It is easily demonstrated that for each of the above seminvariants, and in 
fact for any seminvariant, the sum of the numerical coefficients is zero. Dickson 
[1; 55] gives a suggestion leading to a very simple proof. 

3. The MacMahon Non-Unitary Symmetric Function Principle. Denoting 
the roots of >> (") a;X"* = 0 by a, a, +--+ an, the 7-th power sum of 

t=0 
these roots is defined by 
(7) &s = > ay . 


t=1 




















36 PAUL L. DRESSEL 


The form f may be written II (X — a). 
t=1 


By a result due to MacMahon [4; 131] the seminvariants of the form f are 
identical, except for numerical factors, with those symmetric functions of the 
roots of 


(8) g= Da Xx"* = 0 
=o i! 


which when expressed in terms of sums of powers of these roots do not con- 
tain s,. MacMahon called such symmetric functions “non-unitary.” 
As a result of this theorem, MacMahon was able to discuss the seminvariants 
of a binary form of infinite order by discussing the non-unitary symmetric 
00 


functions of the roots of , 7 Y'=0. 


t=—0 0. 


4. A Third Complete System of Seminvariants. By application of the result 
stated in the previous section, a third complete system of seminvariants can be 
immediately obtained. Obviously the power sums s,,r > 1, are independent 
of s,. By the Waring formula, Burnside and Panton [5; 91-92], if 


do ¥* = w IT (1 - oY) 


then 


_ 5 (—1)’r(p — 1)! (ar\" (e2\” 
(9) —* ea (2) (:) 


wherein 


3)", 


n n 
p= Lm, r= >in. 
i= 


t=1 















g9= ax, 


i=0 2 


woven en Ma) GY =)" 


: ay! 1! eee Tn! (2!)"? eee (n!)™ 





Placing B, = —(r — 1)!s, the B’s form a complete system of seminvariants. 
This result has some interesting statistical connections which will be men- 
tioned later. 





5. Linearly Independent Seminvariants. It follows from the MacMahon non- 
unitary symmetric function principle, or it can be proved easily in other ways, 
that the number of linearly independent seminvariants of a given weight r is 
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equal to the number of partitions of r which contain no unit part. Furthermore 
we have at our disposal a simple method for obtaining a set of linearly inde- 
pendent seminvariants of any given weight. 

For many purposes the power product defined by Dwyer [6; 13] is more 
useful than the customary monomial symmetric function. The power product 
is defined by the right hand member and indicated by the left hand member of 


(11) (q1 +++ Gr) = > = aftaf? ... aft, 

i Aig: * + xi, 
where, for convenience, gq; > g2 > --- > qr. The monomial symmetric func- 
tion which will be denoted by M(q --- q,) is related to the power product by 
the identity 


(12) m!--- mIM(qi'g2” --- ge’) = (qi'q2’ --- ge’), 


so that a distinction occurs only when there are repeated exponents in the 
summation of (11). 

If we desire a system of linearly independent seminvariants of weight 6, by 
the MacMahon principle we need only to compute the values of the power 
products (6), (42), (33), (222) in terms of the a’s. In a somewhat different 
form these will be presented later. 


6. The Roberts Theorem. Roberts, see [2; 231] and [5; 108], demonstrated 
the existence of a duality relationship between power sums, s’s, and coefficients, 
a’s such that corresponding to any seminvariant in terms of a’s there exists 
a seminvariant in terms of s’s obtained by replacing a; by s;. The proof con- 
sists of showing that the annihilator for seminvariants in terms of power sums 
is identical in form with Q, a; being replaced by s; . 

As a result of this duality, each of the systems of seminvariants which have 
been obtained yields, upon replacement of a; bys; , another system of semin- 
variants. In particular cases it may happen that the systems are identical 
when the identities connecting the a; and s; are taken into consideration. 

We next wish to show that the systems of power sum seminvariants thus 
obtained either are identical with certain well known statistical moment func- 
tions or lead to new ones. 


7. Statistical Distributions Represented by Binary Forms. The fact that 
statistical distributions may be represented by polynomials has long been 
recognized by statisticians, see Thiele [7; 24-26] and Bertilsen [8]. Indeed it 
was this fact which led Thiele to the definition of the seminvariants now called 
by his name. If we have given n observations a , a2, --- an, form the poly- 
nomial. 


(13) F= T(x - a) = & (")4xr 


1} ao 
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F is not a binary form, but the seminvariant theory of binary forms is applicable 
since seminvariants are functions of the differences of the roots and are inde- 
pendent of the X and Y, which appear merely as convenient symbols to indicate 
the various terms of the algebraic form. 

For distributions containing an infinite number of items the form F is of 
infinite order, but discussion of its seminvariants may be carried on by use of 
the MacMahon principle given in section 3. 


8. Three Systems of Statistical Seminvariants. Before exhibiting some sys- 
tems of statistical seminvariants it may be well to consider the meaning of 
“statistical seminvariant,’’ for this phrase has been undefined. In fact the use 
of the phrase is merely a matter of convenience in that it emphasizes the fact 
that seminvariant moment functions have not previously been regarded as 
algebraic seminvariants. As used here a statistical seminvariant is an algebraic 
seminvariant which has some application in statistical theory. 

The system of seminvariants (5) yields by application of the Roberts’ Theorem 
the well known — of statistical seminvariants usually called central mo- 


ments. If yu, = ~ = —, the general formula may be written 
So’ 


(14) m= (5) alan 


t=0 


The system of seminvariants (6) likewise leads to 


=i 7 (= “(? "Yale, 


2 i=0 


oni - __1)ttr 2r 22 +1 8. 
(15) am = Xe (-"(, 7") ere etn 


+ > (—1)" ) MiMi Mar, 


t=0 


a system which seems never to have been used by statisticians. 
The system (10) leads to the well known Thiele seminvariants 


3 (- 1)? *r!(o — 1)!(wi)"*(us)™? ~~~ (ue) 
(16) hr ae" ~ pony _ QD"... GD" Gb? 


From sections 3 and 4 it is apparent that the general formula for the Thiele 
seminvariants is a special case of the Waring formula for power sums in terms 
of coefficients. It does not seem that this fact has been previously recognized. 
An equivalent way of stating this idea is to say that the Thiele seminvariant 
d, is, except for the factor —(r — 1)!, the sum of the r-th powers of the roots of 
the equations obtained by setting the moment generating function, 


° ‘ y' 
M.(Y) = ing Bi? 


equal to zero. 
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It is of historical interest to note that MacMahon published his non-unitary 
function principle and the resulting set of seminvariants in 1884. Cayley [8] 
published an article in 1885 dealing with this same system. Roberts’ Theorem 
having been known for some time (probably about 20 years), it seems probable 
that MacMahon and Cayley were aware of the Thiele seminvariants four to 
five years before Thiele’s definition [9] by an entirely different method. 


9. Linearly Independent Statistical Seminvariants. At the end of section 5 
a method was indicated whereby a complete set of linearly independent semin- 
variants of a given weight r could be obtained. It has been noted previously 
that the one part symmetric function s, or (r) leads to the Thiele seminvariant \, . 
As a further illustration consider the power product (22). From a table of 
symmetric functions we find that 


(22) = 2a, po 20341 az 
4lao 38!ap = 2! 2tas 
Q, 403d =) 
a a)’ 
and by the Roberts’ Theorem the statistical seminvariant 


Ju — 4usui + 32”) 

is obtained. In similar fashion a system of linearly independent seminvariants 
of weight < 8 have been computed and are given in Table I. For the sake of 
brevity they are expressed in terms of central moments. Hence the degree, by 
which is meant the maximum degree in the y’’s, is not apparent in the table. 
This definition of degree associates with the statistical seminvariant the degree 
(in the usual sense) of the corresponding homogeneous integral seminvariant. 


10. Statistical Invariants. If the transformation 
(17) r=é§+ mkn, y= mn 
is applied to the binary form f and, if, in particular 

—_ a 


Ao 
one system of invariants of f under this transformation is found to be 
(18) D,=A,/A¥, rn, 


where A, is defined in (5). By the Roberts Theorem we obtain the fact that 
the standard moment p/p” is an invariant of f under this transformation. 
Thus the standard moments, or standard seminvariants in general, have also 
an algebraic connection. The effect of the transformation (17) on the roots of f 
is indicated by 


x— ay = &§ + mkn — man = § — mas — k)n. 
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If m and k are defined as above, the result is the equivalent of measuring in 
, 


standard units denoted by cng 


V ue 

The system (18) is not a system of algebraic invariants, for algebraic invariants 
must be invariant under rotation, translation and change of scale, or stretching. 
The component parts of the above system are invariant only under the last two 


TABLE I 
Linearly Independent Seminvarianis of Weight S 8 
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types of transformation. In statistics translation and change of scale ordinarily 
constitute the only desired transformations so that the standard seminvariants 
Mr Ay Kr 


Fr) \dry) dr, «+ Might well be called statistical invariants. 
M2 Ag | ke 





11. Seminvariants and Invariants of Samples. Consideration of the defini- 
tion of seminvariants and invariants shows that: 
1. A seminvariant is a seminvariant not because it is a function of deviations 
from the mean, but because it is a function of the differences of the observations; 
2. An invariant is an invariant not because it is a seminvariant divided by 
the standard deviation raised to the proper power, but because it is a ratio of 
two seminvariants which are of the same order in powers of the observations. 
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These facts are important from the statistics viewpoint because they show 
that seminvariants and invariants of samples are also seminvariants and invari- 
ants of the population from which the samples are drawn. 


II. ESTIMATES 


1. Power Product Seminvariants. The Roberts Theorem set up a duality 
relationship between seminvariants expressed in terms of coefficients and semin- 
variants in terms of power sums. It can be shown that corresponding to each 
pair thus determined there exists a third seminvariant expressed in terms of 
power products. This leads to what may be called a triple system of semin- 
variants, the interrelationships being most apparent when all three seminvariants 
are expressed in terms of the notation defined by (11). The seminvariant 


3 
a3 3a2eq, , 2a; . : i 
me > + —z becomes in this notation 
ao ao ao 





(111) =3(11)(1) 


3 
4 ay 


n®) n@n ns . 








The corresponding power sum seminvariant is 


@) _ 3@() , 200) 


n n? n> ? 





while the power product seminvariant just mentioned is 


(3) _ 3(21) , 2(111) 





The value of the power product notation lies in the fact that the numerical 
coefficients of the three seminvariants are then identical, while this is not the 
case when monomial and elementary symmetric functions are used. 

Perhaps a few remarks are in order in regard to the proof of the relationship 
above expressed. The annihilator, corresponding to Q, for seminvariants in 
terms of roots is, see [2; 230-31], 

n 
ff} == Zz a S 
i=1 0a; 


It is easy to see that 


( 1’ pz? ++ eo 1 ; 7 r oe Te. 
i p| pepe ao ED LS aeppit pi «+ ET, pe — 1, +B, 


and also that, 





(pi' pi? --- prti'0) _ (n — p+ 1)(pi? +++ petit) _ (pi +++ prez) 
n) os nm died 
Since 
1 Fe erties 3 e 1 - ® ® i Pp 
a| (p2) 2) |- — Do ripi(pi)™*(pa)"* +++ (pps — 1) «++ (Dd, 
n ne §ml 
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and 
(pi)"*(po)"* «++ (pe-a)™*~*(0) _ m(pi)™*(po)™* «++ (pea)"** _ (pi)™* +++ (Dea) 


ne a —_— net : 
it becomes evident that corresponding to any power sum seminvariant there 
exists a power product seminvariant with the same numerical coefficients. The 
converse is also true. 


2. Unbiased Estimates of Rational Integral Moment Functions. If 7 repre- 
sents a population parameter, and if ¢ represents such a function of n observa- 
tions that the expected value of ¢ is equal to 7; then ¢ is said to be an unbiased 
estimate of 7. See Tschuprow [11; 74-75], Bertilsen [8; 144], and Fisher [12]. 

Let (pipe -- - p.) denote a power product computed from a sample, the sample 
being from an infinite population. Then it is well known that 


z| um +++ Ds) 


n® 


‘ / , 
> |= eon °° Mbps, 
n being the number of items in the sample. If FE‘ be interpreted as “unbiased 
estimate of,’’ the above relation may also be written 


(19 Epa boy ++ ap] = EP Pe) 
and it is seen at once that the power product seminvariants defined in section 1, 
if computed from a sample of n observations, are the unbiased estimates of the 
corresponding power sum seminvariants of the infinite population from which 
the sample is drawn. 

This provides an algebraic interpretation as well as a different approach to a 
topic which has already aroused considerable interest among statisticians. In 
1927 Bertilsen [8; 144] gave the estimates of the first four Thiele seminvariants 
of the population in terms of Thiele seminvariants of the sample. In 1929 
R. A. Fisher [12] also obtained these results and gave in addition the estimates 
of the fifth and sixth Thiele seminvariants. His results are in terms of sample 
moments. In 1937, P. S. Dwyer [13; 26] gave the estimates of the first five 
population central moments and indicated also means for obtaining the estimate 
of any rational integral isobaric moment function. 

In the remainder of this chapter 
(1) Dwyer’s method will be extended and perhaps somewhat simplified, 

(2) certain properties of this type of estimate will be pointed out, 
(3) estimates of all seminvariants of weight < 8 will be made available. 


3. Computation of Estimates. From the relationship (19) it is possible to 
write down immediately in a simple, although not immediately useful, form the 
estimate of any rational integral moment function. Thus the fourth Thiele 
seminvariant », is given by 


; e # V ea , 
Ne = wa — Susur — Bus + 12ygui” — bur‘, 
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so that the estimate of \, is 


z, = 4) — 48) _ 3022) , 12211) _ 6(1111) 


n n®) n® rn 








Since power products are difficult to compute directly, it is necessary to 
express the estimates in terms of power sums. Dwyer [6; 30-33] gave a com- 
plete discussion of the problem of expanding power products in terms of power 
sums and also gave tables of power products in terms of power sums for 
weights < 6. By use of (12) it is also possible to use tables giving monomial 
symmetric functions in terms of power sums. One table by J. R. Roe [14; 
plate 18] includes all cases of weight < 10. 

By use of such a table we find 


(31) = —(4) + (8)(1), 
(22) = —(4) + (2)(2), 
(211) = 2(4) — 2(8)(1) — (2)(2) + (2)(1)’, 
(1111) = —6(4) + 8(3)(1) + 3(2)(2) — 6(2)(1)’ + (1)*. 


If these results are substituted in L, above and like terms are collected, it is 
found that 


nL, = n?(n + 1)(4) — 4n(n + 1)(3)(1) — 3n(n — 1)(2)? + 12n(2)(1)? — 6(1), 
a result which agrees with that given by R. A. Fisher [12]. 


4. The Dwyer Double Expansion Theorem. The Dwyer double expansion 
theorem, [6; 34] and [11; 37-39], states that if any isobaric sum of power products 
of weight r indicated by 


r! : 
———_________————_ ,,...,* B® vse fs 
(20) a (qi!)"? ae (q:!)"* ™! Bic mi! q,1 are(Q1 Qt ) 
be expanded in terms of power sums in a form indicated by 


r! 


S a eer oO 


then the coefficient a, of the power sum (r) is given by 


oni TI cts 
(pi!)™? <ee (pe!)"* m1! -++ 1! 


and that the coefficient a,,...,,, of (r1)(rT2) --+ (Tm) is 


(22) a, = >(—1)*" 


eee r 
bor byt. ’ 


(23) Digit * Gilling *** Rays 


The barred product indicates a symbolic multiplication by suffixing of sub- 
scripts which is exemplified by 
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The application of this theorem to the present problem eliminates the use of 
tables and permits the independent computation of the coefficient of any particu- 
lar products of power sums in the expansion in terms of power sums of any given 
estimate. The illustration given by Dwyer [13; 39, 40] exemplifies both of 
these points very well. 


5. Estimates of all Seminvariants of Weight < 8. If the estimates of any 
‘complete system of seminvariants and all products of these seminvariants up 
to and including weight r are known, then the estimates of all seminvariants 
of weight < r are obtainable as a linear combination of these known estimates. 
For example, suppose that we know the estimates of all Thiele seminvariants 
of weight < 5 and wish to find the estimate of u;. Since us = As + 10AsA2 , 


zy [us] = M; = yg" [As] + 10E [AsA2] = Ls ad 10L2 ° 


In table II are given the estimates of all Thiele seminvariants and all products 
of Thiele seminvariants of weight < 8. From this table the expressions for L; 
and Le are obtained and, by taking the combination indicated above, it is 
seen that 


n® M, = (n* — 5n* + 10n”)(5) — 5(n* — 5n? + 10n)(4)(1) 
— 10(n® — n)(3)(2) + 10(n? — 4n + 8)(3)(1)” 
+ 30(n — 2)(2)°(1) — 10n(2)(1)* + 4(1)’, 


a result which checks with that given by Dwyer [13; 27]. In similar fashion 
the estimate of any other seminvariant of weight < 8 can be obtained by use 
of table II. 


6. Computation Checks. There are a number of checks which can be applied 
to the entries in table II. These may be of interest simply as properties of the 
estimates, and they may be of use in correcting errors which may possibly have 
crept into the tables. 

When any power product of more than one part is expanded into power 
sums, the sum of the numerical coefficients of the expansion is zero. To prove 
this we need only to consider a set of observations of which one observation is 
unity and the rest are all zero. Then any power product of two or more parts 
is necessarily zero and all power sums are equal to unity. Hence the initial 
statement of the paragraph follows immediately. 7 


From this fact it is apparent that the sum of the coefficients of L, is = , and 


the sum of the coefficients of L,,,,....., is zero. Thus for Ls, we have 


3 2 2 pe > co 
Ste te =e = , and for Le the sum of the 


coefficients is 


Sg l-m tnt 4n— 44? — 3n +3 — n+ 1] =0. 
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A condition satisfied by the coefficients of any seminvariant is that their sum 
, isequal to zero (See section 2). This provides another check on the entries of 
table II, although the seminvariant must be written in homogeneous form 
before the check is applied. Thus we may write 





ly = | ( +1) %-an4y) @)(1) 
n n 


n 
2 2 4 
— 3(n — 1) = + 12n = — 6n or 
and the sum of coefficients is 


(n+ 1) — 4(n + 1) — 3(nm — 1) + 12n — 6n = O. 


Several checks arise from the fact (see section 6) that every seminvariant 
must be annihilated by the operator 


— a 
24 / = 1 ——. 
(24) Q 2 is * 3a, 


Another check results from the discussion of the next section and is so apparent 
as to need no comment. 


All the checks mentioned in this section are applicable to the estimate of any 
seminvariant. 


7. Estimates as Sums of Simple Seminvariants. A seminvariant such as L, 
in which the coefficients of the m’’s are functions of n will be called a composite 
seminvariant, while a seminvariant in which the coefficients of the m’’s are 
purely numerical will be called simple. The fact that is to be established in 
this section is that every composite seminvariant is the sum of simple semin- 
variants. As an illustration consider L,. It is apparent that 


+ 3 
n n 
Le = Talat 7a he, 


where J, and k, are seminvariants of the sample corresponding to »4 and %. 
Both 1, and k, are simple seminvariants. 

That a composite seminvariant may always be expressed as a sum of simple 
seminvariants can be demonstrated by considering the effect of 2’, (24), on a 
composite seminvariant. The coefficients are polynomials in n and are un- 
affected by the operator. The expression resulting from application of the 
operator can vanish only if the coefficient of n’ vanishes for every r. Thus a 
composite seminvariant which has r different powers of n appearing in its coeffi- 
cients is expressible as the sum of r simple seminvariants, which are not neces- 
sarily distinct. Table III exhibits the estimates of Thiele seminvariants of 
weight < 6 as sums of simple seminvariants. 

Since the factors, appearing in front of each of the simple seminvariants in 
the expression resulting from breaking down a composite seminvariant, are of 
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TABLE III 
Estimates as Sums of Seminvariants 
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successively lower order with respect to n; it is possible to obtain approxima- 
tions of various orders to the value of an estimate by using the appropriate 
portion of the expression given in the table. 


8. The Estimates of the x’s. The seminvariant x, possesses an interesting 
property which will be called invariance under estimate. By this is meant that 
the estimate of x, is k, multiplied by a suitable factor. In particular, xe = we and 
k3 = m3 and it is well known that 
n® 


2 
E [ua] = “5m, Els] = 7, ms 


n®) 


so that the x, certainly possesses the property forr = 2and3. It can be shown, 
however, that 


‘ 


3 
n 
(25) Ko, = Oo i. Kor41 — n® Kor41. 


From (15) 


so that 


i Br ee Pm 2. 


By the Binet-Waring identities [15; 6-7] 
(26) (a-b) = (a)(b) — (a + B) 


and this holds for power products regardless of the values of aandb. Hence 


K, = = +i = = v(? r) oe — i) — (2r) 


n® 


oo), 
~ 2 ia. 1 (- n' (7) 22. 


n°?) 


> (- ? "=0-2+ 5 v(? "), 
(On 


the coefficient o above is ~ = i and it follows immediately that 


1 S\- 1)' C ) 9&9 - 5 Re. 


2 i=0 (2) 


This proves the first half of (25) and the second half can be proved in similar 
fashion, although with considerably more difficulty. 
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9. Other Simple Seminvariants which are Invariant under Estimate. It has 
been previously remarked (Chapter I, section 2) that the « system of semin- 
variants are the seminvariants of minimum degree, those of even weight being of 
second degree and those of odd weight being of third degree. The xo,’s are the 
only seminvariants of degree 2, but for odd weights greater than 7, there exist 
more than one seminvariant of degree 3. It is not difficult to show that these 
additional minimum degree seminvariants are also invariant under estimate. 
The type of proof used could have been applied equally well to obtain the results 
of the preceding section and indicates that the property of invariance under 
estimate which is possessed by the x’s is a direct result of their minimum degree 
property. 

Consider the estimate in power product form of any seminvariant of degree 3 
and odd weight. Power products of 1, 2 and 3 parts will appear. By the Binet- 
Waring identities each three part power product (abc) yields a third degree power 
sum product (a)(b)(c) plus other products of lower degree. Since (a)(b)(c) 
comes only from (abc) its coefficient must be identical with that of (abc) and will 
therefore be a constant divided by n®. The coefficient of each second degree 
product of power sums will be a sum of terms, the first of which comes from the 
corresponding two part power product with a coefficient identical with that of the 
power product, and the others come from the three part power products. Then 
the coefficient of a second degree product of power sums must be of the form 


a ~atat+-+a _ante 


n® n® n® 


Similarly the coefficient of the first degree power sum term will be of the form 


din? + dan + ds 
n® . 


Since the estimate of a seminvariant is a seminvariant, it follows that d; = 


This is true because the coefficient of ee) must be the coefficient of = 
multiplied by —r. Furthermore cz = dz = 0 for if the contrary be assumed it is 
immediately possible to break the composite seminvariant into two simple 
seminvariants, the first being of degree 3 (the original seminvariant) and the 
second of degree 2. Since for odd weights no seminvariant of degree 2 exists, 
it follows that any seminvariant of degree 3 and odd weight is invariant under 


estimate. It is also apparent that the factor n*/n must appear in the estimate. 


10. Composite Seminvariants which are Invariant under Estimate. For 
each weight r 2 4 there exists a composite seminvariant which is invariant 
under estimate. For weights 4 and 5 this seminvariant is easily obtained by use 
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of Table III. Thus for weight 4, form the seminvariant \, + cxA3. From the 
table we find that 


n'* 


ED + onda] = 7 & — C2 qa He ks 


ie: oe... 


ne a 





n* 2 n 2 2 
= =p (l + cools) + G(r? — nom). 


(2) 







2 
If co = n/n 
is 


(27) Ve= Mut B he 


the seminvariant is invariant under estimate. This seminvariant 







In similar fashion we find for weight 5 











(28) Vs = As +? om 


For weights > 5 considerably more difficulty is encountered. For weight 6, 
for example, we consider the seminvariant 


Ne + Coadade + Caz + Co22ds. 
By use of table III we obtain 


6 
E™ [Ns + cazdgdo + C333 + Co223] = a (ls + Calgle + c3313 + corel) + ®, 


















where ® is a sum of other seminvariants with coefficients which are functions of 
n and C42 , C33 , C22. Now there are only four linearly independent seminvariants 
of weight 6 and it is necessary that one of these involve the term (1)°/n°. By an 
argument analogous to that of the previous section this term cannot appear in 
# and therefore # is expressible in terms of three or fewer seminvariants. Ac- 
tually three are necessary and equating the coefficients of these to zero the values 
Of C42, C33 ANd Cog2 are uniquely determined. The result is somewhat lengthy 
and scarcely of sufficient interest to record here. 

The same sort of procedure can be used for determining seminvariants of 
higher order which are invariant under estimate, but the labor of computation 
becomes very great. 

It is possible to obtain moment functions which are invariant under estimate 
by means of a set of equations given by Dwyer [13; 38-39]. These equations 
connect the coefficients of a general isobaric moment function and the coefficients 
of the expected value of that function. In his notation if, for example, 


fa = a4(4) + 4451(3)(1) + 3a20(2)? + 6a2n(2)(1)? + aun(1)*, 
then 


E[fi] = by nus + 4bs 0” us mi + 3bo2n” yi? + bboy n® m2 M1 = n ad eT ’ 
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wherein: 
a4 + 403; + 3de2 + 621 + Quin bs , 
a31 + 3d + Quin ba, 
Qo + 2der + Aun = be , 
dor + Qun = ba , 


Qun = bun. 


The problem at hand demands that 


B| na = + 4n? as: (3) (1) 


2 4 
+ 6n* ae (2)() + n'aun oT 


ns nt 


+ 3n’ ar ail 


n? ne 
= [nag us + 4n asus m1 + 3n aro us” + 6n aon ws mi + nau ut] 

so that the equations (29) become 

noun ca dn aun , 

n* den = d\n (aon + ai), 

nae = d\n” (doz + 2de1 + Gi), 

nay = An (ag, + 3aen + aun), 

nds = AN(Ay + 40g, + 322 + 6den + A111), 


and from these equations a4, 3; , Q22 , G2, can be found in terms of ayy,. Ob- 
viously there is only one solution if none of the a’s are zero. In general, forany 
weight 7, a similar system of equations can be found and they determine the 
coefficients of a moment function of weight r which is invariant under estimate. 
It appears that this moment function is always a seminvariant although no 
proof of the fact has been found. The moment functions of weight 4, 5 and 6 
obtained by this method are identical with y , Ys and W- defined above. 


Conclusion. The results of this paper include: 

1. A demonstration of the fact that the theory of statistical seminvariants is 
identical with the theory of algebraic seminvariants. 

2. The introduction of new statistical seminvariants. 

3. Simplification of the computation of estimates. 

4. Proof that the estimate of any seminvariant is also a seminvariant. 

5. Proof of the existence of a trio of seminvariants with the same numerical 
coefficients. 

6. A discussion of seminvariants which are invariant under estimate. 

Many thanks are due Professor P. 8. Dwyer for his able guidance in the 
preparation of this paper and to Professors C. C. Craig and J. A. Nyswander for 
helpful comments. 
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THE ERRORS INVOLVED IN EVALUATING CORRELATION 
DETERMINANTS 


By Paut G. Hoe. 


1. Introduction. Many statistical problems require for their solution the 
evaluation of correlation determinants. The method usually employed for such 
evaluation is that of Chio,’ in which the order of the determinant is reduced by 
successive operations with selected pivotal elements. The repeated multiplica- 
tions and subtractions involved in the method necessitate rounding off the 
elements in the successively reduced determinants. The calculated value of the 
original determinant is therefore in error; and so the question naturally arises 
as to the magnitude of this error. 

Previous attempts to answer this question seem to be satisfied with finding 
an upper bound for the magnitude of the difference between the value of the 
original determinant and its value after its elements have been rounded off. 
Moreover, this bound is expressed in terms of the errors in the elements and the 
minors of the original determinant, whose values are assumed to be known 
exactly from calculation. However, several reductions are often needed before 
the value of the determinant can be obtained; and furthermore the minors are 
subject to the same type of errors as the determinant itself. The problem, 
therefore, is to find an upper bound for the magnitude of the difference between 
the final calculated value of the determinant and the determinant itself which 
involves only calculated quantities. 

This paper treats the problem from two different points of view. In the first 
part an upper bound is obtained for the magnitude of the error. In the second 
part the first order error terms are given more detailed consideration, with the 
result that an upper probability bound is obtained for the error. 








2. Absolute Bounds. Consider the correlation determinant A = |1r;;|. To 
evaluate A by the method of Chio, it is convenient to select diagonal elements 
as pivots. It will be assumed without loss of generality that the upper left 
diagonal element is always chosen as the pivotal element in each reduction. 
After each reduction, elements are rounded off to a fixed decimal accuracy. 
Let ak; represent the element 7,7 after the k-th reduction, ai; the difference 
between the rounded value of element a‘; and a‘; itself. After k reductions, we 
arrive at the determinant 


k k 
Ok+1k+1 + Th+1k+1 
ke . 
F* = 






" k k 
| Gua + Zac 


1See for example, Whittaker and Robinson Calculus of Observations, p. 71. 
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By treating F* as a function of the x‘, it may be expanded by Taylor’s formula 
as follows: 


n 1 n 
(1) FR=A*+ D0 xi A +o DL shite Alina + *--, 
i,jek+l 2! Retr 


where A* is the value of F* for all x‘ zero, Aj; is the cofactor of a‘; in A“, ete. 

For a determinant of order n, the value of the determinant obtained after a 
single reduction is the value of the original determinant multiplied by the 
n — 2 power of the pivotal element used. Applying this to F", it follows that 


A*® = (ake: + yy i ae 
Af, ia ue *s7;* 
Al ipa Hi** Fyne ’ 
etc., where the exponents of H; are ordinary exponents rather than notation. 
Substituting in (1), 
PY = HEP 4 yp > ok pe 1 Ht OY atct Fe, +... 
ki 2! k+1 ” si 
In order to express F* in terms of the original determinant, this expansion 
will be condensed by means of the following operational notation. 
(2) F=(1+D4+D?+.---+D")He*'F*™, 


where D‘ operates on Hj *'F*" by reducing the exponent of Hz? *" by i units, 
by summing from k + 1 to the product of 7 terms in x* with the corresponding 
cofactors of F*’, and dividing the result by factorial 7. Using this as a recursion 
formula, 


P=(1+D+.--.+D™)HE* "(14+ --- + DHT --. 

(1+ --- + D™“)HT'P. 
However, 

Qu + Zu 
P= <a = A, 
Ann + Inn 

since we assume that z;; = 0 for our original determinant. Consequently, 
(3) Fe= (1+... +D**)HE* (1+... + D/H... 

(1+ .-- + D™")HT“a. 


Since D* operates on F*” in (2) to extract the proper cofactor of 7 less rows than 
in F*, which in turn reduces the exponent of all factors Hx; in the expansion 
of F* by i units, D‘ reduces the exponent of all H’s following it in the expansion 
of F* in (3) by ¢ units. 
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Following these rules of operation, and expanding so as to collect terms of the 
same degree in the x’s, we may write 
F* = Hp’... HDA + Ap” ... Ay (terms in 2;;) + 
Hp  ... H? (terms in 2;;%p,) + ---. 
Letting H = HH, --- H,andC = Hz *" .-- H}’, we may write 


. S 
H IP 


(4) 


(terms in 2;;) + 


r=F--ca=cl 


(terms in 2ij;2%pq) + +f 


and hence 


(5) “a A= 7 (terms in 23) + iz (terms in 2ij;2%pq) +--+. 

Now J is the difference between the calculated value of A, using Chio’s reduc- 
tion method and rounding off after each reduction, and the true value of A. 
We are interested in finding an upper bound for the magnitude of J. To ac- 
complish this we shall first overestimate the number of terms in the various 
sums of (5), then find an upper bound for the magnitude of the terms in these 
sums, and finally combine the two results. 

In counting terms by means of (3), we may ignore the H’s since they merely 
serve as coefficients of the x’s. Therefore consider the nature of the terms in 


(l+---+D™*)1+---+D"*")... (+--+ D"™ a. 
1 n 

i 5 z Lig Lpg Aijng, ete.; 
+ n—s+l 


terms in 2;;2pq, etc. Each of these 


Now (1 + --- + D*)A contains the sums 2. rizA 
, n—st+l1 
27, 1\2 
hence it contains s’ terms in 2;; , == 
is not greater than s”, .2C2, etc.; consequently, the number of terms of each type 


is not greater than the coefficient of the corresponding power of D in the expan- 
sion of (1 + D)”. Therefore, 


(6) (1 + perma + ne eisl (1 + yn sts (1 + D)", 


where m = (n — k)? + --- + (n — 1)’, contains at least as many terms of each 
type as are found in the expansion of F*. This gives us the desired overestimate 
of the number of terms in the various sums of (5). 

In finding upper bounds for the magnitudes of terms, it is to be noted that (4) 
is written with all common factors extracted from each set of terms of the same 
degree in the z’s. In the parenthesis containing terms consisting of the product 
of r x’s, the first sum will have unity for its coefficient while the last sum will have 
H;Hi-1 --- H2 as coefficient, with all sums between having as coefficients prod- 
ucts of H’s with exponents < r. Hence an upper bound for all coefficients in 
this parenthesis may be written as H’, where H is the magnitude of the product 
of those H’s whose magnitude is greater than unity, but unity if none exceeds 
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unity. Now terms in 2;; are multiplied by Aj; , those in 2;;%p_¢ by Aijpq , ete.; 
therefore let A;; , Aijpg , etc., be the absolute values of the largest in magnitude 
of such cofactors. With this notation for upper bounds for magnitudes of 
terms, and (6) giving an upper bound for the number of terms, we may write an 
upper bound for the magnitude of J as follows: 


ry ry 2 
(7) | J | < G ) nC By + (7.) mC2Aiipg + eee, 


where ¢ > | x | is the maximum error of rounding. This result is valid for any 
determinant with real elements. All quantities on the right are available from 
calculations except the A; consequently this upper bound will be useful only if 
satisfactory bounds exist for the minors of the determinant. It can be shown 
that (7) holds for any minor of A, say Au» , if the A have wv added as subscripts; 
and therefore it may be applied to the question of the accuracy of least square 
solutions. 

For the correlation determinant A it can be shown that the magnitude of a 
minor of order n — k is bounded by k!/2" for k even and k!/2**~ for k odd. 


Setting a = : e and substituting these bounds in (7), 


|\J| <am+a a oa Oss Ie". Cass Faken. 
_ 3.3 4.4 
am ,am ,am 
< mae ie cee Oe See 
Saat +--+ tT 
a’ m* 
2(1 — am)’ 
for am < 1. Since am is obtainable from the calculations for A, this is the 
desired upper bound for the error in question. 


2 
(8) Sam +" + 


3. Probability Bounds. In order to find probability bounds for this error, 
it will be necessary to expand the H’s since they involve the variables x. Con- 
sider H, = akg’ + zjz'. Since az’ came from repeated reductions of A, it is 
expressible in terms of the z’s and the minors of A. To obtain this expansion of 
H;, consider 


ke k—s | 
Ak—s+1k—s+1 + Uk—s+i k—s+1 
@’ = 
atz 
8 +4 ae 


Using the same methods as for F", this may be written as 


= B+ = xi;* Bi; + 5 ts 247° 2'pq' Being Se 
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where B’ is the value of G* for all 2*~ 


zero, etc., and where B’ = Hj_.G**), 
Bi; = HiziGit', ete. Substituting, 


Gt = Hie + Hit Dai or + 2 WE DY ay ett + 
Using operational notation here also, this may be written as 
@=(1+E+E4+.--- + E’)HiRG", 


where the E’s operate the same as the D’s, except that sums are taken from 
k—s+1tokrather thanfromn —s+1ton. Treating this as a recursion 
formula, 


H.=G@=(1+ E/ ARI + E+ E)Hi2--- (1 +--. + BAT Ge. 
However, 
Qu + Zn | 


G = ty. - 











Ok + Lex | Akk 


Consequently, 
(9) He = (1+ E)Ha(l + E + B)Hes--- (1+ --- + BHT? hy 


Since the E’s operate on the following H’s to reduce their exponents, the number 
of terms of various types, that is, of various degrees in the z’s, will not be de- 
creased if the order of H’s is disregarded and their exponents held fixed. There- 
fore consider 


(10) Htr= (Ql +E (lt E+F).-- (1 +--- + BE YAA --- AY?’ 


as an ordinary recursion formula in the H’s for overestimating the number of 
terms of various types. If (10) is substituted for successive H’s within itself 
in a systematic manner until no H’s remain, it will be found that 


Hy = (1+ £)--- (1 +--- + E*")d 


11 
_ (1 + £)---(+--- + E**)A,2)” --- ((1 + E)As] 


gk-4 ok-3 


[4° . 

To merely count terms it is permissible to combine like terms to give 

Hi = (1+ or -r'"a +E+ eo eee HS E*)K 
=(1+ BE (1+ E+E) *... (+--+ EY YK, 


where K is the product of the A’s. Since the E’s operate like the D’s, the same 
arguments as those used to arrive at (6) may be used to replace (1 + EF + --- 
+ E’) by (1 + E)" for overestimating the number of terms. Hence, the number 
of terms of various types in H; is not greater than those in 


(1+ 2)" "(1+ 28)" *"*... (1+ E° 1 + EY = (1+ BE”, 
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where w, = 2" * + 2?.2** 4 ... + (k — 2)?.2° + (k — 1). Therefore the 
number of terms of various types in Hz *" ..- H?~? is not greater than in 


(12) (1 4 pre Ser etn sa (1 + EB)‘. 
It is easily shown that ¢ can be condensed into the form 
(13) ¢ = [2° %(n — k)—1] + 22" (nn — k)—1] + «+. +(k — 1)2%(n — k)— I]. 


From (3) it is evident that the number of terms of various types in F* will not 
be greater than those in the expansion of F* when the exponents of the H’s 
are held fixed. But from (6) we have an upper bound for the number of terms 
arising from the D’s, and from (12) those arising from the H’s; hence the number 
of terms in question will certainly be bounded by those in 


(14) (1 + D)™** = (1 + D)*. 


Now consider the magnitude of terms. The terms arising from the operation 
oft D’s contain minors of A as factors, while those arising from the operation 
of E’s contain minors of A; , where 7 ranges from 1 to k. Let A;;, etc., denote 
an upper bound for the magnitudes of all such minors of the same number of 
subscripts. It is easily shown that A’ with 2r subscripts is not less than the 
magnitude of the product of several minors whose subscripts total 2r in number. 
The terms of various types also contain as factors products of the constant 
terms in the H’s. The constant term in H;, which will be denoted by fh; , 
can be obtained from (11) by operating with all ones since it will be unaffected 
by disregarding the order of operation. Hence, 

hy = AcAr-sA-s --- A “AT”. 
Since the A; are principal minors of a positive definite determinant with no 
element greater than unity, h, has unity for an upper bound. Thus, an upper 
bound for the magnitude of any term in the product of 7 z’s will be e* times A’ 
with 22 subscripts. 

With upper bounds now available for the number of terms and the magni- 
tudes of terms, we are in a position to consider the complete expansion of J in 
which the coefficients of the z’s will be constants rather than H’s. Evidently 
the terms in z;; will come from the terms in 2;; of (4) with the H’s replaced by 
the constant terms in their expansions. If Z denotes these terms, then 


fa ... wo | & xij Aaj + he Do 287° Aaj 
(15) k+1 k 


+... + he e+ he So ahydu |. 


Now consider an upper bound for | IJ — Z|. Since J — Z involves only terms 
in the product of two or more z’s, we need consider an upper bound for such 
terms only. From the results of the two preceding paragraphs, we obtain 


| I a Z | < €' C2; ing + é C34 ipque + --:. 
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But from the paragraph containing (8), bounds are available for the A’; hence 


lI-Z\< 202+ ACs + 0.2 Pe 


3 
eu 
, _=@ 
‘aa "* 
for eu <1. Since Z is of order e, ® will ordinarily be small compared with Z; 
therefore consider the nature of the distribution of Z. 
If we write Z = a,%, + --- + a,x,, then, since the z’s are independently 
distributed with rectangular distributions, it is easily shown that pe = 


; > ai, a3 = 0, a = 3 — $ >> ai/(>. a)’. If the a; are approximately equal 


in magnitude, then a, is approximately equal to 3 — 1/p. But from (15) 

1(n — ky? + .-- + 4(n — 1)’, which is sufficiently large for determinants 
employing Chio’s method to justify the _— that Z is approximately 
normally distributed. Setting L = hg-*? .-- Ap, 


m= HE [(S abt 4 Das) +. sh swf (dak +4 45) | 


t<7 t<? 


< lin — K+ > e=9) -Fe-B 4... +6~-09 


< | m— w+ --+(n- 1) — 5 @n — k p]=¥, 


Hence, the probability is >.95 that |Z| < 2W. Since |IJ — Z| < 9, the 
probability is >.95 that |Z | < 2W + #; and therefore the probability is >.95 
that 


(16) wie. 
C 
This inequality will usually give a smaller bound for | J | than (8). How- 
ever, when A is small the H’s may be small, with the result that C will be small 
and (16) may not give a satisfactory bound for | J |. In such cases the bound 
given by (8) may not prove satisfactory either. 


4. Example. Consider a correlation determinant of order 7 in which the 
elements are accurate to 4 decimal places. If Chio’s reduction method is 
applied until a 2 rowed determinant is obtained, then n = 7, k = 5, e = .00005, 

= 90, nu = 176, V = .00005+/160/3, and we obtain from (8) that 


A | AY’ .00000005 
wee (7) eal’ Gi a) + (7) i — .0045 A/H 





ERRORS IN EVALUATING CORRELATION DETERMINANTS 65 


where H/H is obtained from calculations involved in evaluating the deter- 
minant. From (16) we obtain that the probability is >.95 that 
.0008 


lI <-—. 


The relative advantage of the second inequality over the first depends on the 
size of the pivotal elements, as does the usefulness of either inequality. 
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THE CUMULATIVE NUMBERS AND THEIR POLYNOMIALS 
By P. 8S. Dwyer 


In a recent paper [1] the author has shown how the moments of a distribution 
can be obtained from the last entries of cumulative columns with the use of 
multiplication by certain numbers. These numbers may be called ‘“‘cumulative 
numbers.”’ It is the aim of this paper to show how these numbers can be 
obtained from the expansion of 2° in terms of factorials of the s-th order and to 
demonstrate properties of the polynomials of which these numbers are the co- 
efficients. 


TABLE 1 
Successive Frequency Cumulations 


(1) (2) (3) (4) (5) (6) (7) (8) 
x Se Cc! Cc? Cc c c 
a+ 6 64 64 64 64 64 64 
a+5 192 256 320 384 448 512 
a+4 240 496 816 1200 1648 2160 
a+3 160 656 1472 2672 4320 6480 
a+2 60 716 2188 4860 9180 15660 
a+1 12 728 2916 7776 16956 32616 
a 1 729 3645 11421 28377 60993 


or NWR OLD & 


1. The values Ci(u,). We use the notation Ci(uz) of the previous paper 
[1,289] to express the columnar cumulated entries. The 7 indicates the order 
of the cumulation while the 7 indicates the number of the term, counting from 
the bottom of the column. Thus in Table I, which presents the cumulations 
of a frequency distribution used in the previous paper [1,289], Ci = 729; Ci = 
3645; C> = 2916; --- , Ci = 6480, etc. Nowif k + 1 values of z are spaced at 
unit distances and if the smallest value of z is 0, it can be shown that 


pete 1) 


k k k 
= du; G= DL (c+ 1)ue; G= Du; C= 
0 0 0 


z} 


k k om 
g= yp ets, . = 1) 
0 2! 0 2! 


and, in general, j > 0 andj + 1 >i, 


k . *\ (j) 
yp etiti-v 
z=0 


+1 
(1) Ci" = 7 
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Similarly if k values of z are spaced at unit distances and if the smallest value 
of zis 1, it can be shown that 


. 2 < — . >». &@4De 
=u; Ci=Dmu; Ch=DLie-lu; G=>* a ue 
1 2 1 , 1 3 
k ae k ven ae 
q+D=, Yes Che [S== 2) ue 
1 . 1 é 


and, in general, 7 > 0 andj + 1 >i, 


f a (3) 
(2) ci = Zeti-*. = 


It is to be noted that the coefficients of uz in (2) could be obtained from the 
coefficients of uz in (1) by the substitution z + 1 = 2’. 


2. The powers in terms of factorials of the s-th order. If the s-th powers can 
be expressed in terms of factorials of the s-th order (factorials having s factors) 
then the moments can be expressed in terms of the cumulations. For example 


$< 82 es 


so, from (1) 


k (2) 
Sy- Pet i+ > fe = C+ Ch. 


And since 


*9)\ (3) (3) (3) 
pe @+2) $i+0 8 tees 


k (3) 


k 3) (3) 
Lf = ees .o1Eeee fet Lo Fhe = CE + 4C$ + Ch. 
In general if 


An(e +s — 1) + An(z +s — 2) 
a cc 8 te 
s! ’ 
then 
k 


(4) x 2°f. = Aas" + AaCs" + --. + AyCiti +... + An Cr. 


while if the smallest value of z is 1, we have 
k 


(5) » L "fe = AaCi" + AaC;" + Tes + A.,;C;" + — + Ae". 


These quantities, A,;, in (4) and (5) are simply the coefficients of certain fac- 
torials of the s-th order in the expansion of z’s!. 
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These numbers, for small values of s, are easily obtained. It is possible to 
use the table and a recursion formula of a previous‘ paper [1,294—295] for larger 
values of s._ It is also possible to obtain these values, without involving cumula- 
tive theory, from (3) above. 

While doing this we make a more general approach by expanding (a + 2)’ 
in terms of these same factorials with the coefficients now functions of a. This 
is possible if we add an additional term, A.o(x + s), to the numerator of the 
right hand side of (3). We have then 


A(x + s)® + Aa(x +s — 1)” 
. — (8) ao (8) 
(6) (a+2)'= Pe See eae 


s! 


The determination of the values A,; can be accomplished by purely algebraic 
means by successive substitution of x = 0, 1, 2,---s. In this way we obtain 
s + 1 equations in s + 1 unknowns. For example when s = 2 

A ( 9)” + A ( 4)" A 
20\0 c+ 2 a(x + - 22 ot” 


(a+ x)= ee 


so that when x = 0, 1, 2, we have 


a aS Axa ; (a a 1)’ = 3A + Ag ; (a a . = 6A x + 3A fe Ag n 


The solution is Ao = a’ ; An = 2ab + 1; b> where b = 1 — a. It 
follows that 


ess” i Ai e” 


(a+ 2)’ ~ + (2ab + 1) —— ~+ "dll y and hence that 


. (a + 2)*f. = a°C? + (2ab + 102 + UC}, 
0 


as indicated in the previous paper [1,293]. 
When a = 0, then b = 1 and we have 


Lat, = C2 + C3 while when a = 1, b = O and the right 
hand side becomes Ci + C3. 


It follows that the general cumulative numbers might also be defined as the 
solutions of the s + 1 equations in the s + 1 unknowns obtained by placing 
z= 0,1, 2,---,sin (6). 


3. The evaluation of the cumulative numbers. Formal algebraic methods of 
evaluating equations (6) are somewhat tedious so we use finite difference theory 
to aid in finding the solution. As in the previous paper [1] we use the notation 


vzwhena <2 <a+ kl We then write, from (6) 
0 otherwise J 


Vor =. sand es = { 
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si(a + x)* = A(x + o + Aun(z + 8 — 1)“ 
(7) ew ae ances 
+--- + Aja t+s— i) +--+ + Aut”. 


s! when r = oy 


0 whenr ¥ Of' We have then 


We note further that ve + ny” - { 


8) va +5)" = Ay. 


It has been shown in the previous paper [1,292] that 


(9) v"a+j) = : -o(? : ) (a+ jt)’ 


and it appears that the cumulative numbers could be defined by (9). A useful 
recursion formula has been derived from (9) 


(10) V@+2) = (a+a)V'(a+z2)"'4+ (8 +1—a—2)V'(a+e2— 1)". 


4. The cumulative polynomials. We define the cumulative polynomials to 


be the polynomials obtained by using the cumulative numbers as coefficients. 
Thus when a = 0, 


P=y; Po=yty; Ps=yt4yt+y;Ps=yt lly + lly’ + y's ete. 


It is possible to derive a recursion formula for these polynomials. We use 
(10) with s replaced by s + 1 and a = O and get 


(11) Past jas Dvr? (a) y? ss r2V""'(x)*y? + X(s + 9 pe x)V""(a - 1)'9’, 
which becomes, after some manipulation, 
(12) Poy = (1 — y)Eav""(a)"y® + (8 + LyP.. 


To illustrate we get P, from P; = y + 4y? + y*°. Now E2Vi(x)*y" =yt 


8y° + 3y’ and Py = (1 — y)(y + 8y’ + 3y*) + 4y(y + 4y° + y*) = y + Ly’? + 
4y° + y‘. The recursion formula (12) can be expressed also in the form of a 


differential equation, since P; = : (P.) = raV""*(a)'y*", as 
y 


(13) Puy = yf(1 — y)Ps + (8 + IPI. 


It can be shown more generally that for any a 
Pog = 1; Pai = a t+ by; Pao = a’ + (2ab + 1)y + b’y’, etc. with 


(14) Passi = yl — y)Pae + [a(1 — y) + (8s + Ly) Pans 


as the recursion formula. 


5. The numerator coefficients in successive derivatives of the logistic function. 
Lotka has recently exhibited the coefficients of the numerator terms of suc- 
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cessive derivatives of the logistic function [2, 160]. These appear to be, aside 
from sign, the same as the cumulative numbers when a = 0. It is shown in 
this section that these numbers are the cumulative numbers. The scheme is 
generalized to include the numerator coefficients of the derivatives of a more 


general function involving the parameter a. 


rt 
Lotka used the function @) = = 


(1 + ert)?’ eo) — 
Pe — | | ) 
ape? etc. The numerical coefficients are the same if r = 1 so we might 


1 : 
ise and obtained 4, = 


as well use yp = ae . Amore general function is the two parameter function 


e 
(15) d= er 


Let successive derivatives with respect to x be indicated by ®g,c,1 ; ®a,c,2 ; Pa,ec,s } 
ete. Then 
i e“[a + c(1l — a)e’] 
a,¢c,1 ( + ce*)? ? 
— e*[a? + (—2a? + 2a + 1)ce* + (1 — a)*c*e*] 
_ (1 + ce*)3 
In general, 


Daic,s i et = e* Qa.e,e (1 + ce”) 
so that 


(1 + ce*)*+2 


Do c,41 - 


and 


(16) Ouc.008 = (1 + ce”) [aQa.c,s + 2 _— (s + 1)ce* Qa.c,s . 


The Q functions can be changed to polynomials with the substitution e* = y. 
Then derivatives are taken with respect to y and 


(17) Pa,c.o+1 = (1 + cy)|aPa,c,s + yP4:c.] — (s + 1)cyPa.c,s . 


When c = —1, this becomes formula (14) and since Po,o = 1, it follows that 
the numbers of the present section are generalized cumulative numbers. When 
c = 1 anda = 0 we have the numbers found by Lotka. 

It can be shown, further, that the c coefficient of y’ isc’. It follows that the 
absolute values of the coefficients, when c = 1 and whenc = —1, are the same. 


6. Formulas for 2z°. A formula for the sums of the s-th powers of the 
integers from 1 to k is obtained by summing (3). We get 
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k k (s) 
: (c+s-—1) 
2 x = And ———__,—_ + 


s! 
k — s\le 
+ Aw D ats. + 
from which 


> " = (k 4 o wee 
sl 


(s + 1)! 
(19) (k + - es er» 
eT Ese” Ce er 


a 8 @+s-7"" _ 1 
2 Au Geil ~GFDIE 


«= cy” (1)V°""(3)’. 


j=l 


_ (k +e- perrvr(jy 
(20) 


For example . 
Ft a= EAD + HAY _ Wk + VR + 1) 
1 3! 6 : 
Ha HHI 44+ HO + H+) _ E+ DV! 
1 


4! 4 
a+k 
More generally the values of > 2’ can be evaluated by 


a+k 
@) Ler = pt eet s— pve +s = Lose + 7 
a j=0 
7. Summary. It is shown how the cumulative numbers and the cumulative 
polynomials may be obtained in a variety of ways. Of special interest is the 
fact that the cumulative numbers can be obtained by expanding powers in 
terms of factorials and hence they might be called factorial coefficients of a 
kind. It is also possible, though it is not within the scope of this paper, to 
establish interesting relations between the cumulative numbers and the multi- 
nomial coefficients, the usual factorial coefficients, the difference of 0, etc. 
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ENUMERATION AND CONSTRUCTION OF BALANCED INCOMPLETE 
BLOCK CONFIGURATIONS '* 


By GERTRUDE M. Cox 


1. Intzoduction. One of the general problems of experimental design is to 
avoid extraneous effects in making desired comparisons. ‘The method employed 
is to use experimental materials as nearly homogeneous as possible. Such 
materials, however, are seldom available in large quantities. On the contrary, 
field soils vary in fertility from block to block, animals vary with both litter and 
sex, and leaves on one young plant differ from those on another. Differences 
between blocks, between litters and sex, and between plants, being irrelevant 
to the comparisons usually contemplated, must be avoided. 

When the number of treatments to be compared is small, well known methods 
of design, such as the Latin square or randomized complete block, are available 
and efficient. As the number of treatments increases, however, these designs 
tend to become less efficient through failure to eliminate heterogeneity. Fur- 
thermore, they become cumbersome, the Latin square design requiring replicates 
equal in number to the treatments and the complete block design providing that 
each treatment occur in every block. (Blocks are defined as an assemblage of 
experimental units chosen to be as nearly alike as possible.) 

Because of such limitations, several modifications of the complete block design 
have been devised. These new designs all have the common characteristic that 
the experimental material is divided into groups or blocks containing fewer units 
than the number of treatments to be compared. These more homogeneous 
small blocks are referred to as incomplete blocks. 

It is desirable to have all comparisons between pairs of treatments made with 
equal accuracy. This requires of the design that every pair of treatments 
occur in the same block an equal number of times. Such a design is referred to 
as balanced. Balanced incomplete block designs can be arranged (for any given 
number of treatments) only for certain combinations of block size and number of 
replications.” 

The construction of balanced incomplete block designs is mathematically a 
part of the theory of configurations. A configuration is an assemblage of 
elements into sets, each element occurring in the same number of sets, and each 


1 A revision of an expository paper presented under a different title at a joint meeting 
of the Institute of Mathematical Statistics and Biometric Section of the American Statisti- 
cal Association, December 27, 1939. 

2 Numerous additional designs are available in the partially balanced incomplete blocks 


(3). 
72 
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set containing the same number of elements. The configurations to be con- 
sidered here are the complete configurations, i.e., those in which each element 
occurs an equal number of times in the same set with every other element. It 
would be useful to know, (a) what configurations (within the useful range) 
exist. (b) how these configurations may be constructed. 

The typical requirement of the experimenter is this: “I wish to test ¢ treat- 
ments and can use blocks of size k(t > k). I should like a design which will 
involve as little experimental material as feasible.’”’ The designer must then 
determine what configuration of ¢ elements in sets of k will satisfy the incidence 
relation that each pair of elements occur together in a set an equal number of 
times, and for which the total number of sets is a minimum. There are still 
many configurations which the experimenter needs but which have not as yet 
been constructed. 

In order better to explain the construction of these balanced incomplete block 
designs, it is essential to specify the underlying combinatorial problems. A 
configuration satisfying the condition of balance can be obtained by writing 
down all possible combinations, b, of the ¢ elements taken k at a time, 


= 
~ Rt — BD! 


The simplest example is that in which each set contains only two elements and 
all possible combinations of the ¢ elements, taken in pairs, appear in the different 
sets. This series of pairs can be written out by the experimenter, and the 
method of analysis is given by Yates [20]. 

Let us take another example; given six elements to be taken three at a time, 


6! 
~ 313! 


b= CC; 


b = 6C3 20. 


The 20 combinations are, 


123 134 146 236 845 
124 135 156 245 346 
125 136 234 246 356 
126 145 235 256 456. 


Such unreduced designs are not necessarily economical or feasible in experimental 
work. It is often desirable to find some less extensive configuration. In this 
example half of the combinations, either those in italics or the other half, fulfill 
the restriction that every element occur with every other element in the same 
number of sets. Each pair of elements occurs twice in either group of sets. 
Thus, a balanced incomplete block design can be based on either half of the 
20 sets as well as on all 20. 


2. Combinatorial methods. Combinatorial considerations of a simple nature 
enable us to set up necessary conditions which balanced designs must satisfy. 
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We have ¢ elements arranged in b sets of k elements each; each element occurs inr 


sets, and each pair of elements occurs together in a set exactly \ times. Then 
we must have 


tr= bk, rik —1) = X(t — 1). 


The first of these equations expresses the fact that the total number of plots 
must be equal both to the product of elements by replications and to the product 
of sets by number of elements per set; the second, that the number of pairs into 
which a given element enters must equal A times the remaining number of 
elements. 

It is convenient to write 


a Mt=-D ye = 1) 


ona * ~ k(k — 1)° 


Since the numbers f, b, r, k, \ must be integers, it is easy to obtain lower limits 
for any three in terms of the other two. 

To give a general classification, the configurations have been divided into 
classes according to the value of }. Because of the practical limitations in 
experimentation, table I has been expanded only to include \ = 6 and the k 
values from 1-14. It may be well to call attention to the fact that duplications 
occur in the different classes of table I. For instance in the class, \ = 1, for 
k = 6,t = 15m + 1, and m = 1, thenb = 8,andr = 3. Inorder to construct a 
design, the following condition is necessary; r > k and therefore b > t. In this 
example, the condition is met if b, r and \ are multiplied by 2, the resulting design 
ist = 16,b = 16,r = 6,k = 6anddA = 2. This configuration is a duplicate 
of the design in the class, \ = 2, fork = 6 and m = 1. In many of the con- 
figurations where is 3, 4, 5, or 6, a common factor can be cancelled from b, r and 
\ giving a design listed in the classes , \ = 1, 2 or 3. 

It should be emphasized that the conditions under which table I was derived 
are necessary, but not sufficient, for the existence of a complete configuration. 
For example, consider the following configurations which satisfy the necessary 
conditions for a design. 

Sub class 

(table I) m b r 
10m+ 5 1 21 7 
2im+ 1 1 22 7 
15m + 6 2 42 7 
42m+ 1 1 43 7 
45m + 10 2 1 
110m + 1 1 1 


100 110 1 
111 111 1 


1 
1 
No configurations of the above specification can actually be constructed. 


A selected group of configurations from table I is given in table II. Only 
those configurations whose k, r and A lie within practical limits, and whose 
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existence has not been disproved, have been included. The practical limits of 
k, r and X, of course, are dependent upon the conditions surrounding the experi- 
ment. We have chosen to keep k within the range 3 to 10 except for a few special 
configurations in which ¢ is greater than 100, in which cases k was allowed to 
equal 11-14. Also r has been kept within a similar limited range. (Those 
configurations in table II, with an asterisk preceding ¢, have not been con- 
structed.) 

The above limitations upon k and r give a small, selected group of configura- 
tions. However, many others either have been constructed or are known to 
exist. For balanced incomplete block designs, Yates [20] gives the lower limits 
of r for ¢t from 4 to 25 and k from 2 to 12 but not greater than 3¢. Fisher and 
Yates [8] have tabulated the configurations which are known to exist having 
ten or less replications including all arithmetically possible configurations the 
existence of which has not been disproved. 

Even if the existence of a configuration has not been disproved, there still 
remains the difficult problem of writing out the elements which are to appear in 
each set. Some discussion of the structure of such configurations is presented 
by Fisher and Yates [8] by Yates [20, 21] by Goulden [9, 10] and by Bose [4]. 
Additional descriptions are to follow. 

While a search of the literature revealed a number of constructed configura- 
tions, yet the general theory of their formation has received relatively little 
consideration. The question of combinations related to the theory of configura- 
tions which is of interest here was first set forth by Kirkman [11] in 1847. He 
states the problem thus: “If Q, denote the greatest number of triads that can be 
formed with x symbols, so that no duad shall be twice employed, then 


3Q: = x(x = 1)/2 - Ve 


if for Vz we put 0, when z = 6m + 1 or 6m + 3.” This gives the formula for 
which was given earlier in this article. Putz = tand V, =0 


t(é-—1)  e¢é-—1) 

3-2  kk-—41)° 

Besides the theory connected with these combinatorial problems, considerable 
information related to the construction of the configurations has been found in 
the literature on finite projective geometry, especially the geometry which applies 
to the theory of groups. 

An extensive discussion of the \ = 1 class of configurations (as listed in table I) 
can be found in the literature. The theory of the formation of the configurations 
for the sub-class t = 6m + 3 has been summarized by Ball [1]. This is the 
Kirkman “school-girl problem” for which Eckenstein [7] lists 48 papers and 5 
books written during the years 1847-1911 dealing with this subject. The 
problem was first published in the Lady’s and Gentleman’s Diary for 1850 [12]. 
It is usually stated that ‘‘a schoolmistress was in the habit of taking her girls 
for a daily walk. The girls were fifteen in number, and were arranged in five 
rows of three each, so that each girl might have two companions. The problem 


b=Q: = 
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is to dispose of them so that for seven consecutive days no girl will walk with any 
of her school-fellows in any triplet more than onee.’”’ For this particular sub- 
class ({ = 6m + 3, k = 3), this type of configuration has been shown to exist 


TABLE II 
Selected Group of Configurations 
(Balanced Incomplete Block Designs) 
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1 Youden squares. 
* Lattice squares. 


for every possible value of t. Most of the solutions were worked by H. E. 
Dudeney and O. Eckenstein. They are given by Ball [1] for all ?’s less than 100, 
that is, for ¢ = 9, 15, 21, 27, 33, 39, 45, 51, 57, 63, 69, 75, 81, 87, 93 and 99. 
Ball describes several methods of constructing such configurations, as cycles, 
combinations of cycles, scalene triangles inscribed in the circle, focal and analyti- 
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cal methods. As an illlustration of the school-girl problem, the construction 
of the configuration for t = 9, b = 12,r = 4,k = 3 and A = 1 will be shown, 
Scalene triangles are inscribe’ in a circle with certain specifications (to be 
fulfilled) giving the three sets of triplets for the first day as follows, 


Set Group I 


(1) k 1 5 
(2) 3 4 6 
(3) 7 8 2. 


By rotation or by cyclic substitution the other three groups are secured: 
Set Group II Group III Group IV 


(4) (7) ks 7 (10) k 4 8 
(5) (8) 5 S (11) 6 
(6) (9) 1 4, (12) 2 


Then placing k = 9, we have the configuration for ¢ = 9, b = 12, andr = 4, 
Note that in the school-girl problem the sets are grouped into complete replica- § 
tions of the elements. This problem of 9 girls taken 3 at a time has been sub- 
jected to an exhaustive examination. There are 840 arrangements but only one 
fundamental solution. In the case of 15 girls, the number of fundamental 
solutions according to Mulden [14] and Cole [6], is seven. Ball mentions the 
Kirkman problem in quartets which is the sub-class t = 12m + 4, for k = 4. 
He states that this has been solved for cases where m does not exceed 49. He 
also states, ‘I conjecture that similar methods are applicable to corresponding 
problems about quintets, sextets, etc.” 

Before leaving the school-girl problem, an illustration will be given of ¢ = 28, 
b = 68,r = 9,k = 4andA = 1. The following framework was set up by Dr. 
C. P. Winsor using suggestions from Netto [15]. 


7 1 
3 5. 


6 
2 


k a S» c 


ay, ag bs be 
ae a7 bi bs 
a3 dg C4 C5 
a on) Cy Cs 
be bz C3 Ce 
by bs C2 Cz. 


a, b and c each have every internal difference once and only once; and each pair 
a-b, a-c and b-c must have every external difference once and only once. The 
nine groups are given in table III. The cyclic substitution is within three sets, 
a,bandc. That is, 
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ingroup I, a=1 
in group II, a= 2 
in group III, a=3 

etc. 


, 4 = 2, a = 3, ag 
’ Qa = ’ ae = ’ oe ag 
» Q&T A= 5, ag 


Netto [15] discusses ¢ elements in sets of k, every set of 2 elements to occur 
together in a set exactly \ times. He deals with A = 1, and gives a discussion 
of both sub-classes when k = 3, that is, fort = 6m + landt = 6m+3. Reiss 
[16] and Moore [13] have proved that configurations can be constructed for all 
values of tifk = 3. This is the type of information which is valuable in answer- 


TABLE III 
Configuration for t = 28, b = 68,r =9,k =4,rX\ = 1 
Group I Group II Group III Group IV 


10 19 11 20 3 12 22 


28 
2 13 16 14 17 2 15 10 
3 11 18 12 10 13 12 
4 23 24 24 25 9 25 27 
20 27 21 19 8 22 21 
22 25 23 14 10 24 15 11 19 
21 26 22 16 17 23 17 18 20 


Group VI Group VII Group VIII Group IX 


23 15 24 || 28 16 25 17 18 
11 5 18 12] 8 6 10 13 7 ll 12 
13 16 14); 9 17 15 18 10 
19 3 19 20 1 4 20 21 5 21 6 22 
22 2 25 23 || 2 3 26 24 4 27 5 19 
20 17 13 27 21 || 18 14 19 22) 10 15 20 11 16 21 


21 10 11 26 22 || 11 12 27 23 || 12 13 19 13 14 20 
Lcsnliasurneessp eset lnesenteaiiiaspetisieiailescenanabinivaeacaeachinsa a oninsraendaeicnianhtaiacieadecaanimanisanmennenial 


ing the first question in the introduction of this article; “what configurations 
exist?” Carmichael [5] mentions the quadruple systems 6m + 2 and 6m + 4 
and states that the general problem of their existence appears not to have been 
solved. Also for the higher values of k there seems to be very little known of 
any generality, but it is known that for k > 3 there are certain configurations 
which are not possible. 


3. The method of geometrical configuration. Another aid in the construction 
of balanced incomplete block designs is found in some of the finite projective 
geometries. These are described by Carmichael [5]. A tactical configuration 
of rank two is defined as a combination of | elements into m sets, each set con- 
taining \ distinct elements, and each element occurring in y distinct sets, 





GERTRUDE M. COX 


= (t) = number of points in the geometry, 
(b) = number of lines, 
(k) = m mber of points, 
a = (r) = number of lines on a point. 


The series of finite projective geometries PG(x, p”) for « > 1 furnishes a 
certain infinite class of these tactical configurations. The following list gives 
those which have been incorporated in the list (table II) of useful balanced 
incomplete block designs. 


Two dimensional space, PG(2, p”) 
L(t) m(b) A(k) 


7 7 
13 13 
21 21 
31 3l 
57 57 
73 73 
91 91 10 
133 133 12 
183 183 14 


Three dimensional space, PG(3, p”) 
l m A Be 


2 15 35 7 3. 


From the Euclidean geometry EG(x, p") for x > 1 other tactical configurations 
can be constructed. These are formed from the PG(x, p") by omitting a given 
line from the two dimensional space and a plane from the three dimensional 
space configurations. Some of the resulting designs are: 


Two dimensional space, EG(2, p”) 


m A 


< 
3 


6 
12 
20 
30 
56 
72 
90 


tw 


—_ 
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Methods are available for constructing the two dimensional space PG(x, p”) 
and the corresponding EG(x, p") configurations where p is a prime number. 
This being true, we can also construct the completely orthogonalized squares 
from the EG(x, p") geometry. The reverse situation in which these configura- 
tions are constructed by using the completely orthogonalized squares is to be 
illustrated. These squares consist of superimposed Latin squares, fulfilling the 
condition that each number from the second Latin square occurs once and only 
once with each number in the first Latin square. As an example take the two 
Latin squares: 


Latin Square I Latin Square II 


1 1 3 2 
2 2 1 3 
3 3 2 1. 


Superimpose square II upon square I to get the completely orthogonalized 
3 x 3 square, 


11 23 32 
22 3l 13 
33 12 21. 


The first number in each cell is a value from square I; the second number in each 
cell is from square II. Note that the numbers in the second place in each cell 
occur once and only once with each of the first numbers, that is 1-1, 1-3, and 1-2. 


The completely orthogonalized squares have been proven to exist for all prime 
numbers and for powers of prime numbers. The solution of this problem was 
secured independently by Bose [2] and by Stevens [18]. Those of sides 2, 2”, 2°, 
2°, 2°, 2°, 3,37, 3°, 3°, 5, 5°, 5°, 7, 77, 11 and 13 have been given. 

The completely orthogonalized 3 x 3 square may be used to construct 











a balanced incomplete block design. The italic numbers, which follow the 
cell numbers, designate the 9 elements which are to be arranged in four groups of 
three sets. Group I is formed by placing the elements from each row into sepa- 
rate sets, in group II the elements from the three columns are placed in three 
sets; in group III the first set (7) consists of the elements which follow 1 in the 
first place in the cells, set (8) consists of the elements which follow 2 in the first 
place in the cells; and group IV is assembled in the same way as group III except 
the numbers in the second place in the cells are used to select the elements for 
each set. Thus-we have the configuration: 
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Group I Group II Group III Group IV 
Set (rows) (columns) (first place) (second place) 


(1) 1 4 7 (4) 1 2 8 (7) 1 6 8 (10) 1 5 9 
(2) 2 5 8 (5) 4 5 6 (8) 2 4 9 (ll) 2 6 7 
(3) 3 6 9 (6) 7 8 9 (9) 3 5 7 (12) 3 4 8 


In the 12 sets of 3 elements, each of the 9 elements occurs with every other 
element once and only once in a set. 

This is an illustration of one series of configurations which can be constructed 
with the aid of the completely orthogonalized squares. These are the EG(x, p”) 
in two dimensional space when x = 2 and p” = 2, 3, 2”, 5, 7, 2°, 3°, 11, 13,... 
The PG(x, p") configurations can be written by adding (k +1) elements 
to the previous group of configurations. For example, the elements 10, 11,12 
and 13 may be added to the groups, one to each group. That is, 10 is added to 
each set in group I, 11 is added to each set in group II, 12 to group III and 13 to 
group IV. An additional set must be added to include these four new elements. 
A configuration fort = 13, b = 13,k = 4,r = 4andd = 1 results. 


The 13 sets are made up of 4 elements each. These designs are symmetrical 


for sets and elements, that is, every pair of elements occurs together in the same 
number of sets, also, every pair of sets has the same number of elements in 
common. Discussion of the construction of these designs with illustrations are 
given in references [20, 8, 9] and [19]. 

In the PG(x, p”) series of designs, as constructed by means of completely 
orthogonalized squares, the sets cannot be arranged in replication groups. How- 
ever, these configurations can be arranged in Youden squares [22] in which all 
the sets are placed side by side and all the elements in a single row form a com- 
plete replication. This method of arrangement has been of considerable value 
in experimentation with plants. The Youden squares are the PG(x, p”) when 
x = 2. Singer [17] gives a partial list of the (reduced) perfect difference sets 
(table IV), only a single set for each p". The number of distinct perfect differ- 
ence sets (or the number of distinct perfect partitions) for a given p” is equal to 
¢(q)/3n. Since each perfect difference set can be paired with its inverse, the 
number is even. 

The construction of one of the Youden squares from its perfect difference set 
will be illustrated. Consider p” = 3 then g = p°'" + p?+1=3' +341 = 13. 
There are two perfect difference sets with their inverses for g = 13. One perfect 
difference set is 0, 1, 3, 9 which has the perfect partition 1, 2, 6, 4 which will 
add in succession to each number from 1 to and including 13, and also 1, 2, 6,4 
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add to 13. The elements of the perfect difference set are put in set (1) except 
that 13 replaces 0. Set (2) is secured by a one-step cyclic substitution, 1 for 
13, 2for 1,4 for3and10for9. This process is continued until there are thirteen 
sets. If the substitution is applied to set (13), the elements in set (1) are secured. 


Set 
(1) (2) (3) (4) (5) (© (7) (8) (9) (10) (11) (12) (13) 


Replica- A 13 1 2 3 45 6 7 8 9 10 11 12 
tion B 1 2 3 4 56 7 8 9 10 11 12 18 
C 3 4 5 6 7 8 9 10 11 12 13 1 2 

D 9 10 11 12 1312 3 4 5 6 7 8. 


This is the Youden square for t = 13,b = 18,r=4,k = 4, andA = 1. The 
elements in each row form a complete replication. 


TABLE IV 
Singer’s list of perfect difference sets 


_ Perfect difference set _ 





36 63 
63 116 127 136 181 194 204 233 238 255 


56 77 «81 
43 § 


38 88 104 109 
42 82 


;Wwwwwwww Pr Ww) 
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A third series of configurations, called Lattice squares or quasi-Latin squares 
[21] can be constructed by using the completely orthogonalized squares. The 
groups of sets on page 78 are taken in pairs. For each pair a square is constructed 
having its rows formed by the sets of one group and its columns by the sets of 
another group. For example, square I below is made so that the sets of group I 
form the rows and the sets of group II form the columns. Square II is the 
combination of groups III and IV. 


Square I Square IT 














6 
2 
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In this lattice square each pair of elements occurs together once only in cither a 
row or a column of cither one of the squares. Also, every clement occurs with 
every other element once in one column and one row from cach square. 

A device known as “complements” gives several configurations. From an 
arrangement having k ¥ 3t, a second one can be obtained for the same number 
of elements, in sets of t — k units. This is done by replacing each set by its 
complement, that is, by a set containing all the elements missing from the 
original set. An illustration follows: 


7, b=7 
=3, k=3 
A=1 


Set Set 


(1) (1) 6 
(2) (2) 6 
(3) (3) 9 
(4) (4) 3 
(5) (5) 4 
(6) (6) 4 
(7) ; (7) 4 5 6. 


No or WN 
eS IO OP W DO 


While the triple systems, quadruple systems, ete., which have been con- 
sidered by some mathematicians, do furnish designs meeting the balance re- 
quirements, they are usually not suitable for experimental purposes. A quad- 


-ruple system requires that every possible triple of elements occur once and only 
once together in a block. Since we need only every pair together once (A = 1) 
or more, only the triple systems are generally useful. 


4. Summary. The mathematical theory of configuration has been helpful 
in the construction of the balanced incomplete block designs. It would be use- 
ful to know (a) what configurations (within the useful range) exist, (b) how these 
configurations may be constructed. In table I the configurations have been 
classified according to the value of A, while in table II configurations within a 
useful range have been listed. Of the designs in this table which have not been 
constructed, some are known to exist. Those aids which have been used in the 
construction of the balanced incomplete block designs have been briefly dis- 
cussed. 
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A COMPARISON OF ALTERNATIVE TESTS OF SIGNIFICANCE FOR 
THE PROBLEM OF m RANKINGS’ 


By Mitton FRIEDMAN 


A paper published in 1937 [2] suggested that the consilience of a number of 
sets of ranks can be tested by computing a statistic designated x?. A mathe- 
matical proof by 8. 8. Wilks demonstrated that the distribution of x7 approaches 
the ordinary x” distribution as the number of sets of ranks increases. The 
rapidity with which this limiting distribution is approached was investigated by 
obtaining the exact distributions of x for a number of special cases. It was 
concluded that “when the number of sets of ranks is moderately large (say 
greater than 5 for four or more ranks) the significance of x; can be tested by 
reference to the available x’ tables” [2, p. 695]. The use of the normal distribu- 
tion was recommended when the number of ranks in each set is large, but the 
number of sets of ranks is small, although no rigorous justification of this pro- 
cedure was presented. 

Except for the few special cases for which exact distributions were given, the 
paper did not provide a test of significance for data involving less than six sets of 
ranks and a small or moderate number of ranks in each set. This important 
gap has now been filled by M. G. Kendall and B. Babington Smith [1]. In 
addition, they furnish a somewhat more exact test of significance for tables of 
ranks for which the earlier article recommended the use of the x’ distribution. 

Kendall and Smith use a different statistic, W, defined as x7 divided by its 
maximum value, m(n — 1), where n is the number of items ranked, and m the 
number of sets of ranks.” The new statistic (independently suggested by W. 
Allen Wallis [3] who terms it the rank correlation ratio and denotes it by 77) is 
thus not fundamentally different from x7. A more radical innovation is the 
improvement in the test of significance that they suggest. Instead of testing 
x? by reference to the x’ distribution for n — 1 degrees of freedom, Kendall and 
Smith, generalizing from the first four moments of W, recommend that the 
significance of W be tested by reference to the analysis of variance distribution 
(Fisher’s z-distribution) with z = ! 10g, (S—4 = Or), m = (n—1) - . m= 

2 1— W m 
(m — 1 | —-1)- . For small values of m and n, they introduce con- 


1 The author is indebted to Mr. W. Allen Wallis for valuable criticism and to Miss Edna 
R. Ehrenberg for computational assistance. 
2 This is Kendall and Smith’s notation which will be used in the present paper. The 


original paper [2] designated the number of items ranked by p, and the number of sets of 
ranks by n. 
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, fais , 12 = 
tinuity corrections, substituting for W = n(n e n)’ the statistic 
12 
W-— — 
, a, 5-1 a m2(n3 — n) 
iia... ,., 8 .* 
2. ) + [+> 


12 


m?(n* — n) 
where S is the observed sum of squares of the deviations of sums of ranks from 
the mean value, m(n + 1)/2. Comparison with exact distributions of W (or S) 
for special cases indicates that this test yields very good approximations to the 
correct probabilities. 
In the limit the two tests of significance are identical. Neglecting the 
(m — 1)x? 


correction for continuity, z log ( )- log ( xr ) n 
0 y ‘ = — ® ee a = = — 
2 m(n — 1) — x; 2 tn“ 


(m — | m= 1) - | oc,andn = (n—1) — . —(n-—l)asm—o. For 
m m 


m =, the analysis of variance distribution is identical with the distribution 
2 


n 
a single (limiting) distribution for all values of m, whereas the other, z, adapts 
the distribution to the value of m. 

The necessity of taking into account the value of m, while it increases the 
flexibility of the distribution, makes the z test somewhat less convenient in 
practice than the x’ test. Additional computation is required to obtain the 
values of m; and ne, and to make the continuity corrections. It is also fairly 
laborious to test the significance of the result, if exact values of z at any level of 
significance are required. In these instances, two-way interpolation of recip- 
rocals in the analysis of variance tables is necessary since both mn, and nz are 
always fractional. These difficulties make it desirable toinvestigate the rapidity 
with which the significance levels given by the z test approach those given by the 
x’ test, and thus determine the range of values of m and n for which the simpler 
test can safely be employed. This investigation will yield as a by product the 
05 and .01 significance values of x7 (or W or S) for selected values of m and nas 
determined by the z test. 

Table I presents a summary comparison of the values of x at the .05 and .01 
levels of significance as shown by (1) exact distributions, (2) the z test with 
continuity corrections, (3) the x’ test.* The significance values are expressed in 
terms of x? rather than W because, for a given number of ranks per set (i.e., a 
given n), the significance values given by the x’ test are the same regardless of the 
number of sets of ranks (i.e., of the value of m). This would not be so if W 
were employed, since W = x7/m(n — 1). The expected value of W depends on 


of 3 log. x The difference between the two tests is thus that one, x’, uses 
1 











’ The values of x? computed using the z test that are given in Tables I and II were ob- 
tained with the aid of Fisher and Yates’ Table V [4]. Linear interpolation of reciprocals 
was employed throughout. 
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m and approaches zero as m — « while the expected value of x? is equal to n — 1 
for all values of m. 

The values given by the z test agree remarkably well with the exact values, 
With but two exceptions (the .01 values for n = 3, m = 8 and ‘10) the exact 
value differs very much less from the value given by the z test than from the 
value given by the x’ test. In all but three of the 12 comparisons, the z test 
gives a value below the correct one.’ 


TABLE I 
Comparison of Values of x? at .05 and .01 Levels of Significance Yielded by Exact 
Distributions, z Test with Continuity Corrections, and x? Test 
| .05 Level of Significance .01 Level of Significance 


From Exact | From | From Exact From | 
Distribution | 2 test | Distribution | z test 
bh peri + with | 1 ————_—_——— | with 
in- | conti- | | conti- 
iterpo-| nuity | oe nuity 
lated | correc- | la | correc- 
tions value* | tions 


8 012 


| From 
x? test 
| Limits 

| 


9.00} 8.35 


.991 8.6 - 9.6 | 9.04 8.51 
991) | 9.21 


qo 


999 | 
991 | 


6. 
| 6.004 | 5.991 | 8.67) 8.44 
5. 
5. 


or 


43 
52 
Oo 
.82 


9.21 
9.66 
9.95 
11.34 | 


qgJ3 J J J 





.59 | 9.49 | 9.87-10.13) 10.05) 10.08 
49 | 9.49 | | | 13.28 

















* Computed by linear interpolation of probabilities. 


Table II gives for a very much larger number of values of m and n the .05 
and .01 values of x7 computed on the basis of the z test with continuity correc- 


4 These comparisons duplicate some of those made by Kendall and Smith and merely 
serve to confirm their conclusion that the z test with continuity corrections gives exceed- 
ingly good results. 

The values obtained using the z test without continuity corrections agree less well with 
the exact values than those obtained with the aid of the continuity corrections. However 
even if no continuity corrections are made the z test in general yields values closer to the 
exact values than does the x? test. 
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TABLE II 


Values of xi at .05 and .01 Levels of Significance Computed on the Basis of Kendall 
and Smith’s z test, with Continuity Corrections; .10, .075, .02, .015 Values of x? 





n 


3 | 4 | 5 | 





Values at .05 Level of Significance 





43 
52 
57 | 
63 
67 | 
a | 
74 | 
80 


Nas 


~I 


~J] 


.82 





6.25 | 7 











18 | 6.90 | 8.49 


Values at .01 Level of Significance 

10.08 
21 | 10.93 
66 | 11.42 
95 | 11.74 
31 12.13 
52 | 12.37 
79 =| = 12.67 
.93 12.82 
.26 | 138.19 


34 13.28 15.09 


82 | 9.84 11.67 | 13.39 

















x? (.015)* 8.40 10.46 12.34 | 14.09 15. 


*Computed from Fisher and Yates’ Table IV (4) by linear interpolation between the 
logarithms of the probabilities. 
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tions. The values entered for m = © are obtained from x’ tables for n — 1 
degrees of freedom and are the significance values by the x’ test for all values of 
m. It is apparent that as m increases the .01 and .05 values of x? approach their 
limiting values very rapidly. For n = 7, two-thirds of the difference between 
the .05 values for m = 3 and m = o, and an even larger proportion of the 
difference between the .01 values, disappears by the time m = 10; and the 
situation is similar for the other values of n. Except for the .05 values for n = 3, 
the approach to the limit is monotonic from below. The use of the x’ test thus 
tends to lead to the overestimation of the significance values and of the probabili- 
ties attached to observed values of x7. It is clear, however, that for large and 
even moderate values of m the x’ test is, for all practical purposes, equivalent 
to the z test. 

In order to determine more precisely the range of values of m and n for which 
the approximation given by the x’ test is adequate, it is necessary to adopt some 
convention about the error in estimated significance values of x: that is tolerable. 
Since the conclusion drawn from an observed x; depends on the probability 
that it will be exceeded by chance, this convention clearly should be expressed in 
terms of the error in the probability. 

The structure of published x” tables makes it convenient to accept an estimated 
probability between .10 and .05 as a tolerable approximation to a correct prob- 
ability of .05, and an estimated probability between .02 and .01 as a tolerable 
approximation to a correct probability of .01. These ranges of tolerance are 
entirely on one side of thecorrect probability because, as pointed out above, the 
error in using the x’ test is consistent in direction. These ranges are purely 
arbitrary, of course, and many may think them too broad. 

On the basis of this or some similar convention it is possible to make objective 
statements concerning the range of values of m and n for which the x’ test is 
adequate. The next to the last line in the first section of Table II gives the .10 
values of x’; the next to the last line in the second section, the .02 values. All 
the .05 values of x? shown in the table exceed the .10 value of x’. Using the x 
test, all of the values (with two exceptions for n = 3) would signify a probability 
greater than .05 but less than .10. Thus the error made at the .05 level is 
within the admissible range according to the suggested convention. The x’ 
test is therefore an adequate substitute for the z test at the .05 level for all 
values of m and n except possibly for a few of the values for which exact dis- 
tributions are available. 

As might be expected, the x’ test is less satisfactory at the .01 level. For 
values of m less than six, the .01 values of x7 computed using the z test with 
continuity corrections are less than the .02 value of x’. For m greater than 5, 
the values of x? in the table would all be accorded a probability greater than .01 
but less than .02 if the x’ test were employed. As already noted, this is the range 
of values of m for which the original paper suggested the x” test could validly be 
used [2, p. 695]. 

In view of the arbitrary nature of the convention as to the permissible error 
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in the probability attached to an observed value of x; , it is interesting to in- - 
vestigate the effect of an alternative and stricter convention, namely, that only 
probabilities from .075 to .05 and from .015 to .01 be accepted as approximations 
to correct probabilities of .05 and .01 respectively. The .075 and .015 values of 
x are given in the last lines of the two sections of Table II. On the basis of this 
convention the x’ test is adequate at the .05 level for m greater than three, and 


TABLE III 


Values of S at .05 and .01 Levels of Significance Computed on the Basis of Kendall 
and Smith’s z test, with Continuity Corrections 


‘i Additional values for 
n=3 


4 | 56 | 6 | 7 | 8s 


Values at .05 Level of Significance 


64.4 | 103.9 | 157.3 | 
49.5 | 88.4 | 143.3] 217.0 
62.6 | 112.3 | 182.4] 276.2 | 
75.7 | 186.1 | 221.4] 335.2 
101.7 | 183.7 | 299.0! 453.1 | 
127.8 | 231.2 | 376.7| 571.0 
192.9 | 349.8 | 570.5 | 864.9 
258.0 | 468.5 | 764.4 | 1158.7 





Values at .01 Level of Significance 





75.6 122.8 | 185.6 

61.4 | 109.3 176.2 | 265.0 | 

80.5 | 142.8 229.4 | 343.8 

99.5 | 176.1 282.4 | 422.6 

66.8 | 137.4 | 242.7 388.3 | 579.9 
85.1 | 175.3 | 309.1 494.0 | 737.0 
121.0 | 269.8 | 475.2 758.2 | 1129.5 
177.0 | 364.2 | 641.2 | 1022.2 | 1521.9 








at the .01 level for m greater than nine, except possibly for a few of the values 
for which exact distributions are available. Thus even so drastic a lowering of 
the permissible margin of error as halving it limits only slightly the range of 
values of m for which the x’ test is adequate. 

Table II provides, of course, a direct means of testing the significance of 
observed values of x? for the tabled values of m and n. For this purpose, how- 
ever, Table III, giving the significance values of S is more useful, since it obviates 





92 MILTON FRIEDMAN 


e . . Y° = . ‘ mn . 
the necessity of converting S into x;. For n = 3 Table III includes a few 
values of m in addition to those in Table II. 


SUMMARY 


The preceding analysis suggests that the x° test of the significance of x? 
(or W or 77), while less accurate than the z test proposed by Kendall and Smith, 
is adequate for practical purposes at the .01 level of significance if the number of 
sets of ranks (m) is greater than 5; and at the .05 level for any number of sets of 
ranks, provided the number of ranks in each set (n) is more than 3. Exact 
distributions are now available for n = 3, m = 3 to 10; n = 4, m = 3 to 6; 
n = 5,m = 3[l1]. The .05 and .01 values of x? and S, computed using the 
Kendall and Smith z test with continuity corrections, are given in Tables II 
and III of the present note for n = 3 to 7 and selected values of m from 3 to 100, 
For n greater than 7 and m less than 6, the z test with continuity corrections 
should be employed. For all other combinations of n and m not covered by the 
exact distributions or by Tables II and TII, the x’ test is adequate. 
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NOTES 


This section is devoted to brief research and expository articles, notes on methodology 
and other short items. 


NOTE ON AN APPROXIMATE FORMULA FOR THE SIGNIFICANCE 
LEVELS OF Z 


By W. G. Cocuran 


1. Introduction. An important part has been played in modern statistical 
2 


j sia aie s : 
analysis by the distribution of z = 4 log | , when s} and s3 are two independent 
S2 


estimates of the same variance. In particular, all tests of significance in the 
analysis of variance and in multiple regression problems are based on this 
distribution. Complete tabulation of the frequency distribution of z is a heavy 
task, because the distribution is a two-parameter one, the parameters being the 
number of degrees of freedom, 7; and nz in the estimates sj and s3. Thus each 
significance level of z requires a separate two-way table. Fisher constructed a 
table of the 5 percent points in 1925 [1], and this has since been extended by 
several workers [2] to the 20, 1, and 0.1 percent level for a somewhat wider range 
of values of m and ne. 

With his original table, Fisher gave an approximate formula for the 5 percent 
values of z, for high values of nm, and m2 outside the limits of his table. The 
formula reads: 


(1) z (5 percent) = 16449 _ 0.7843 (; - 1), 
Wh i Mm Ne 


where . = Z + 2. 
h m = M 
The constant 1.6449 is the 5 percent significance level for a single tail of the nor- 
mal distribution, and the constant 0.7843 will be found to be ${2 + (1.6449)’}. 


Thus the general formula for the significance levels of z derivable from (1) is 


p= 8 - (7 +9\(2 - 2) 
Vh—1 6 mM Ne)’ 


where z is a normal deviate with unit standard error. By inserting the appro- 
priate significance level of zx, this formula has been extended [2] to the tables of 
the 20, 1, and 0.1 percent levels of z and commonly appears with all published 
tables of z. The objects of this note are to indicate the derivation of the 
formula and to suggest an improvement upon it in the latter cases. 
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2. The transformation of the z-distribution to normality. For high values 
of m and nz, the distribution of z approaches the normal distribution, the 
principal deviation being a slight skewness introduced by the inequality of n, 
and n.. It is therefore natural to seek an approximate formula for the distri- 
bution of z by examining its relation to the normal distribution. For the 
z-distribution the ratio «,/«2/”, where x, is the r** cumulant, is of the order 
oo. where 7 is the smaller of n, and m2. This property is common to a large 
number of distributions which tend to normality; for example, the distribution 
of the mean of a sample of size n from any distribution with finite cumulants. 
Fisher and Cornish [3] have recently given a method, applicable to all distribu- 
tions with this property, for transforming the distribution to a normal distri- 
bution to any desired order of approximation. They also obtained explicit 
expressions for the significance levels of the original distribution in terms of the 
significance levels of the normal distribution, discussing the z-distribution as a 
particular example. The relation between z and the normal deviate x at the 
same level of probability was found to be 


x 2 1 1 1 fa°+3x ,2°+11z (Z - 
2 =—-} 2){— -— — — {—___— + ——__—__ Al — — —]} >» 
the three terms on the right hand side being respectively of order n?, n™*, and 
n', so that terms of order n” are neglected.’ 

If this equation is compared with equation (1), the latter appears at first 
sight to be the approximation of order n” to the z-distribution, except that the 
divisor of z is ~/h — 1 in (1) and V/h in (2). Computation of a few values 
shows that at the 5 percent level, equation (1) is the better approximation. For 
example, for n, = 40, ne = 60, (1) gives z (5 percent) = .2334, (2) gives .2309, 
and the exact value is .2332. 

Since 

x x x -2 

“= = + + terms of order n 
Vh—-1 WVAh  2hvV/h 
Fisher’s approximation difgers from (2) by including a correction term of order 
n'. Inspection of the true correction terms of this order in equation (2) shows 


3 2 
. 1 , z 
that for finite values of n; and nz the term . + te Sh (2 — 1) is consider- 
144 ny Ne 





3 
ably smaller than the term ~ > 


2h~/h 


, since the former has a smaller numerical 


. ; . 1 1 , 
coefficient and involves the difference between — and —. Thus Fisher’s 
ni Ne 
formula gives a close approximation to the true formula of order n', provided 


3 2 
that 5 is approximately equal to ; Le. if 74S is approximately equal 


1 Fisher and Cornish also gave the two succeeding terms. 
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to 1. For the 5 percent level, s = 1.6449, and os = 0.951. Thus at the 
5 percent level the use of Vh — lin iQ) instead of +/h extends the validity of 
Fisher’s approximation from order n‘ to order n“?. 

This ingenious device, however, requires adjustment at other levels of sig- 
nificance. The values of*(z” + 3)/6 at the principal significance levels are 


shown below. 











—— ance level 











-% | 40 | 30 | 20 | 10 5 F 0.1 
| 


0.55 | 0.62 | 0.77 | 0.95 | 1.40 | 2.09 


h= (2+ 3)/6 | 0.51 | 





“af if Jka h oii Saneiie (1) j is replaced by ~/h — X, —_ the above values of X, 
Fisher’s formula will be approximately valid to order n“ at all levels of signi 
cance. In particular, for the tables already published of the 20, 1 and 0.1 
percent points, \ may be taken as 0.6, 1.4 and 2.1 respectively. The values of z 


~ by, the use of ~/h — 1 and Vi h — i are compared below for n, = 24, 
= 60.” 





Approximate formula 





———— Exact value 


| 
| 
Vi=1 Wh —rx 
| 


Significance Level 





. 1346 1337 . 1338 


20% 
1% .3723 .3748 .3746 
0. 1% -4875 .4966 .4955 








The use of we h — X gives values practically correct to 4 decimal places; 
except for the 0.1 level of significance, at which the higher terms become more 
important. 

With the aid of this formula, complete tabulation of the z-distribution for a 
given pair of high values of m and nz is relatively simple. If very low proba- 
bilities at the tails are required, the further approximations given by Fisher and 
Cornish [3] may be used. 
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? The numerical terms in the approximate formula given for the 20 percent points on p. 28 
of Fisher and Yates’ Statistical Tables are in error. Their formula should read: 
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A NOTE ON THE ANALYSIS OF VARIANCE WITH UNEQUAL CLASS 
FREQUENCIES’ 


B. ABRAHAM WALD’ 


Let us consider p groups of variates and denote by m; (j = 1, --- , p) the 
number of elements in the j-th group. Let 2;; be the 7-th element of the j-th 
group. We assume that z;; is the sum of two variates ¢;; and 7;, i.e. 2;; = 
€:; + n;, where e;; (¢ = 1, --- ,m;;j = 1, --- , p) is normally distributed with 
mean yp and variance o°, and 7; (j = 1, --- , p) is normally distributed with 
mean yp’ and variance o”. All the variates ¢;; and 7; are supposed to be dis- 
tributed independently. 


The intraclass correlation p is given by* 
o” 


. - +o” 
Confidence limits for p have been derived only in case of equal class frequencies, 
i.e. m = mM = --- = m,. In this paper we shall deal with the problem of 


determining the confidence limits for p in the case of unequal class frequencies. 
72 
° e ° ° o ° ° . 
Since p is a monotonic function of —, our problem is solved if we derive confi- 
o 

o” 

dence limits for —. 
oC 


Denote by Z; the arithmetic mean of the j-th group, i.e. 
(1) i; =* + nj. 


m; 


Hence the variance of #; is equal to 


2 
i 2 o 2 
mM; 


72 
Denote — by \*. Then we have 
o 


2 


(3) o:,=0° (2 + »’) =7 


™; j 


1The author is indebted to Professor H. Hotelling for formulating the problem dealt 
with in this paper. 

2 Research under a grant-in-aid from the Carnegie Corporation at New York. 

3See for instance R. A. Fisher, Statistical Methods for Research Workers, 6-th edition, 
p. 228. 
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where 


(4) Wj 


_ mj 
1 + m;d2" 


Now we shall prove that 
p 2 
p W;X; 
w;| # — 7 
i i - 


, ie Wj 


j=1 


l<¢ 
9 
o* j= 


(5) 


has the x‘-distribution with p — 1 degrees of freedom. Let 
Yi = Vu; 8; 
and consider the orthogonal transformation 


aa Lily, ara » Ye); 


a Lpaly, oe Yp), 


Vwi yi + +s + Vp ¥p 
Vwi t +++ + u, 


where Lily, +++, Yp))-++ y»Lpalyi,--+,Yp) denote arbitrary homogenous 
linear functions subject to the only condition that the transformation should 
be orthogonal. 

Since the mean value of y; is equal to ~/w; (u + uw’) and the variance of y; 
is equal to o°, we obviously have: The mean value of y; (Gg =1,---,p— 1) 
is equal to zero, the variance of y; (j = 1, --- , p) is equal to o*. In order to 
prove our statement, we have only to show that the expression (5) is equal to 


1 2 . . ~ } - 
3 (yy? +--+. + y24). If we substitute in (5) a= for Z;, we get 
7 


p 

1 Pp 2 y pe Vw; Yi a Vu; Yi 
" Yi 2 Yi j=l 2 

= 5 . I ° or /W; : ; Pp : + Pa Ss 

Oo” j=1 W; ‘j z. u j 


sos Ly, or , Yo) = 


’ 


ye eI CE vail 
ui 2 dw; — 2, w; 
(do Vw; yi)” 

dw; 


1 i. 
. (yy? + 06 typi). 


Lyi - 
2 
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z;)° 


has the x’ distribution with N — p degrees of freedom, the 


expression 


. _ _ LW; Fj 
(6) | N= peel? Zu; ai 


p—1 22% — 3)? 
has the analysis of variance distribution with p — 1 and N “— p degrees of 
freedom, where N = m +--- + m,. In case m = m, = --- = Mp = M, 
we have 


— > (2; — 2)° 
/ aces a I= ; ie 
(6') ,* >. 7 i a a 1+ m»? 


TV >... adit om 
220i ag pt = N=? mz(z; — 2)" 
p—1 22(x; — 


2_ (F* _ )2 
y= (FE 1). 


If F,; denotes the lower and F, the upper confidence limit of F, we obtain for ’ 
the confidence limits 


where < = 


Hence 


F* 1 F* 1 
§ - 1) 3 — (F " 1) =. 


Let us now consider the general case that m,, --- , m, are arbitrary positive 
integers. First we shall show that the set of values of \’, for which (6) lies 
between its confidence limits F; and F:, is an interval. For this purpose we 
have only to show that 


is monotonically decreasing with \’. In fact 


df(r*) Y. du; € - —_ i» & (= a e (3, _ 
dz 2 an? \ rw; 2 Lu; 2X iN Sw, JY 


Since 
e met) 7 
2, w; (3 - ; = 0, 
we have 
df(.”) _ & du; (2;- —_ _¢< 
dy? -2 dd? =u;)/ 2X 


which proves our statement. 
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Hence the lower confidence limit \{ of \” is given by the root of the equa- 
tion in d*: 


P Sin = 2 
Ys 7 G2; 
(7) F= ==» zw (* ZU; )} = 


= F, 


p—-1l  —— 2X(aij — 4)? 
and the upper confidence limit \3 of \” is given by the root of the equation in )’: 
(8) F = FP,. 


Since f(A”) is monotonically decreasing, the equations (7) and (8) have at 
most one root in \”. If the equation (7) or (8) has no root, the corresponding 
confidence limit has to be put equal to zero. If neither (7) nor (8) has a root, 
we have to reject at least one of the hypotheses: 

(1) vii = €j + 17;- 

(2) The variates ¢e;; and 7; (¢ = 1, --- ,m;;j = 1, --- , p) are normally and 

independently distributed. 

(3) Each of the variates ¢;; has the same distribution. 

(4) Each of the variates 7; has the same distribution. 

The equations (7) and (8) are complicated algebraic equations in \”. For 
the actual calculation of the roots of these equations, well known approximation 
methods can be applied making use also of the fact that the left members are 
monotonic functions of \”. In applying any approximation method it is very 
useful to start with two limits of the root which do not lie far apart. We shall 
give here a method of finding such limits. 

Denote by F the function which we obtain from F (formula (6)) by substi- 
tuting ' 


- l; die 
— 1+5x for w; (j ™ 1, +++, Dp). 
Let f be the function obtained from f by the same process. 
Denote by g(m, d”) the function which we obtain from F by substituting m 
for l,,---,1,. We shall first show that F is non-decreasing with increasing 
aF 


m P); 1.€. al; 


> 0. For this purpose we have only to show that 


of r 
i > ; >have: 
aly 0 We have 


i y (, _ 20,4) _ 
d 


al, > ab, 20, 


ee oe (: — — 

GQ+heye\* 2a) 

Hence our statement is proved. Denote by m’ the smallest and by m’” the 
greatest of the values m,---,m,. Then we obviously have 
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(9) o(m’, N°) < F < o(m", d’). 


12 


Denote by Aj’, Ai”, Az”, Az” the roots in d” of the following equations respectively: 
g(m’, ”) = Fs ; 
g(m”’, ’) = F, } 
g(m’,’) = Fi;  o(m",”) = Fi. 


Since F is monotonically decreasing with increasing \°, on account of (7), (8), 
and (9) we obviously have 


19 


us <u su 


and 
12 


ye <u sc a. 


The above inequalities give us the required limits. 


Co.tumBIA UNIVERSITY, 
New York, N.Y. 


THE DISTRIBUTION OF QUADRATIC FORMS IN NON-CENTRAL 
NORMAL RANDOM VARIABLES 


By Witu1am G. Mapow' 


The following theorem is the algebraic basis of the theorem of R. A. Fisher 
and W. G. Cochran which states necessary and sufficient conditions that a set 
of quadratic forms in normally and independently distributed random variables 
should themselves be independently distributed in x’-distributions.” 

THEeorEeM I. If the real quadratic forms q,,---,Qm, M %,-++,2%n, are 
such that 


(1) ity = Lis, 


and if the rank of q, is ny, then a necessary and sufficient condition that 


(2) . ies 


1 The letters z, 7, u, v will assume all integral values from 1 through n, the letter y will 
assume all integral values from 1 through m, (n > m), the letter @ will assume all integral 
values from n; + --- + n 1 + 1 through ni + --- + n,, (no = 0, m1 + +++ + Mm = 0’); 
the letters 8, 8’ will assume all integral values from 1 through n’, and the letters r, s will 
assume all integral values from 1 through n — 1. 

2 The references are: W. G. Cochran, ‘‘The Distribution of Quadratic Forms in a Normal 
System, with Applications to the Analysis of Covariance,’’ Proc. Camb. Phil. Soc., Vol. 
30 (1934), pp. 178-191, and R. A. Fisher, ‘Applications of ‘Student’s’ Distribution,” 
Metron, Vol. 5 (1926), pp. 90-104. 











DISTRIBUTION OF QUADRATIC FORMS 


where the real linear functions zg of the x, are defined by 

(3) a, = Dy cpez 
' 8 

1s | 


(4) n’ =n. 























Furthermore the system of linear forms (3) constitute an orthogonal transformation. 

Proor: Necessity. Since the rank of a sum of quadratic forms is less than 
or equal to the sum of their ranks, it follows that n’ > n. Upon substituting 
from (3) for the z’s in (1), and using (2), it is seen that, for all values of the z’s, 


yo zs 7” +> * Cy Crp) Zpzp° 
B B,B’ v 
and hence, from (1), it follows that 


(5) z Crp Crp: = Spe 


where dgg, = 0, if 8B ¥ B’, and dg, = 1 if B = B’. However, since the rank 
of the system of linear forms (3) is not greater than n, and since the matrix 
of (5) is the product of the matrix of (3) by its transposed matrix, it follows 
that (5) can be true only if n’ is not greater than n. Consequently n’ = n. 
It then is an immediate result of (5) that the transformation (3) is orthogonal. il 





Sufficiency. We assume that n’ = n. By a real linear transformation of { 
I, +++, 2, We obtain linear forms z, such that ‘ 
qr = pS Caza ’ i 
4 
. where cg = lor —1. The set of linear functions z,, --- , zn are linearly inde- i) 
. ° . hy 
- pendent, for if z, # 0, and if real numbers h,, --- , An; not all zero, exist such 
Bi 
that, say, i 
ure i 
Za. = > hy2r ‘ 
i 
2 
La = >> Heer2e- 
v T,8 

Substituting, we have } 
2 r, v i 
is Da= Dod= DL Hace ys, | 
Y v T,8 py i 
ral 
n'), where z, = ),cz,. (It is not assumed here that the matrix of the c’” is the 
vill “ { 
inverse of the matrix of the c,,. That fact is a consequence of this proof.) i 
mal Denoting the matrix of z;, --- , Zn-1 by Cr we see that the matrix of z dy is i 

Vol. es Y 
n,” CHC, where H is the matrix of the H,, and has rank less than or equal to n — 1 : 
$ 


which contradicts the hypothesis. Hence if C is the matrix having the elements 
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c, in its main diagonal and zeros elsewhere and if C, is the matrix of 2, --- , 2, 
it follows that 


CLC. « i, 


where I is the identity matrix, i.e. the matrix having ones in the main diagonal 
and zeros elsewhere and C, non-singular. Then C = C;"’C;' and hence C is 
the identity matrix and C, is orthogonal. 

Among the hypotheses of the Fisher-Cochran theorem is the hypothesis that 
the mean value of x, is 0, and the variance of x, is o°. However, in connection 
with his analysis of the distribution of the multiple correlation coefficient,’ 
R. A. Fisher derived the distribution of the sum of the squares of n independently 
distributed random variables 2, --- ,2,, the probability density of zx, being 
given by 


(6) p(x,) = (240°)? exp | - = (zt, — «|, 
20° 


More recently, P. C. Tang,’ has used the distribution of the sum of non-central 
squares in his study of the power function of the analysis of variance test. 

In this note we extend the Fisher-Cochran theorem to non-central random 
variables. If the random variables x, are independently distributed with 
probability densities given by (6), Fisher and Tang have shown that if x” = 


1 ne ; se nad 
=, + az, , then the probability density of x” is given by 
o~ » 


bad 2\" 
9 ons 9 = aw (3rx"") 
(x’) = le (2/5 i. 4/2 _ eee 7 
? - » yao v!T(4n + v)’ 


where \ = es Z. a; . 
20? v 


We now give necessary and sufficient conditions that a set of quadratic forms 
in normally. and independently distributed random variables should themselves 
be independently distributed in x’-distributions. 

THEOREM II. Let 2, --- ,2n be independently distributed random variables, 
the random variable x, having probability density (6). Denote > 22 by q, and 


1 2 ; 
denote 5,3 Z. a, by X. Let m1, +--+ ,Gm, be quadratic forms, 
- » 
; ( 
o » aS tthe 
wy 


such that Zz, dy = q, and let the rank of q, be denoted by n,. 
Y 


3 R. A. Fisher, ‘‘The General Sampling Distribution of the Multiple Correlation Coeffi- 
cient,’’ Proc. Royal Soc. of London, (A), Vol. 121 (1928), pp. 654-673. 

4P.C. Tang, ‘‘The Power Function of the Analysis of Variance Tests with Tables and 
Illustrations of their Use,’’. Statistical Research Memoirs, Vol. 2 (1938), pp. 126-149. 
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A necessary and sufficient condition that the quadratic forms ny (x? = - 


Y? 9 
o 


be independently distributed with joint probability density 
(8) p(x’, «++, xm) = IIT p(x), 
7 


where p(x.) is given by (7) with n, and d, in place of n and x, and 


(9) hy = 35 Dal? ana, 
isn’ = Nn. 

Proor. Necessity. Tang’ has shown that the distribution of x” is given 
by (7) and that if the x; have joint distribution (8), then the distribution of 
x t+) + xn, (= x”), is (7) with n’ in place of n. Upon comparing terms, 
we see that n’ = n. 

Sufficiency. By Theorem I there exist n orthogonal linear functions (3) such 
that (2) is true. Then it is easy to see that the random variables z;, --- , Zn 
are independently distributed with a joint probability density 


(10) plz, +++, 2n) = (2n0°) ” exp [—3 D(z, — ay)’, 


where 


> a, = >. a, and a, = 7 Gu 
v v v 


If we set 20°, = b az, then we have, from (7) and (10), that the x, are 
independently distributed with joint probability density (8). It is only neces- 


sary to show that Dae=>> a\;a,a, in order to complete the proof of the 


a BY 


> af) a,a, = » (Do as? ciner) aia; 
id ] wy 


On the other hand, by direct substitution for the z’s we see that 


dy = x z=) (du CyaCra) LpXy 


BY 


theorem. Now 


and hence a{7? = Zz. CuaCya- Since (1) is an orthogonal transformation, 
Qa 


Z as,” Cin Cj» — 7. cz CyaCva) Cip Civ —_ 2 bai da; , 
a BP a a 


where 6a: = 0, if a # 7 and = 1 if a = 7, which completes the proof. 

It is emphasized that the form of \, makes it unnecessary to calculate the 
matrix of g, to determine \, since the values a, need only be substituted for the 
z, in the original expression for g, to determine ), . 


WasuHineTon, D. C. 


5 See 4 p. 140. 





LOUIS OLSHEVSKY 


TWO PROPERTIES OF SUFFICIENT STATISTICS 


By Louis OLSHEVSKY 


The concept of sufficient statistics was introduced by R. A. Fisher in 1922, 
It was refined and extended in 1936 by Neyman and Pearson who gave defini- 
tions of shared sufficient statistics and sufficient sets of algebraically independent 
statistics.. Today the concept plays an important part in the theory of the 
subject. Characterized briefly, a statistic associated with a single or specific 
population parameter is sufficient when no other statistic calculated from the 
same sample sheds any additional light on the value of the parameter. We 
shall prove that sets of sufficient statistics possess certain interconnections so 
that when one set is known every other set with a like number of members and 
linked with the same population parameters is discoverable. 

THEOREM 1. Jf 7,,---, 7m area set of m(m S n) algebraically independent 
sufficient statistics with regard to the parameters 6, , --- , 0, and the probability law 
P(t1,--:,%n|01,---,0,,--- , 61), @ necessary and sufficient condition for the 
sufficiency of any set of m algebraically independent statistics Ti, +++, Tm with 
regard to the same parameters and the same probability distribution is that the T; 
be a set of independent functions of the T; (4,7 = 1, --- ,m). 

Proor: As an adjunct in the demonstration we cite the following tneorem 
due to Neyman.” For a set of algebraically independent statistics T; , --- , T'n 
to be a sufficient set with regard to the parameters 0,, --- , 0,, it is necessary 
and sufficient that in any point of sample space, except perhaps for a set of 
measure zero, it should be possible to present the probability law in the form 
of the product 


P(t1, +++ In| 01, +--+, Og,--+ , 1) 


= P(T1, +--+, Tm| 01, +++ Og) P(t1, ++» Ln 5 Barr, «++ , Hr) 
where p(71,--- , T'm| 61, +--+ , 9) is the probability law of 7,,---, Tm and 
the function ¢ does not depend upon 6), --- , 9,. 

The sufficiency of the condition stated in the hypothesis of Theorem I is now 
immediately evident. For, if p’ and ¢’ refer to the second set of algebraically 
independent statistics and T; = 7;(T;, --- , Tm) where the functions are inde- 
pendent, the relations can be solved for the 7’; in terms of the T; giving 
T; = TT, a Todi p(T: , area Tih, oe » Oa) 


weedy, ..-<  Tad«+ Thi po TDK, + 


(1) 


9.) 2efas +++» Ta) 
ant, =~ a 
; a(T1, «++, Tm) 
p' (a1, +++, tn} Oq41, ©, 1) = P(t, «++, En; Ogu, -++, M1) + TC a 


1See Neyman and Pearson: ‘‘Sufficient Statistics and Uniformly Most Powerful Tests 
of Statistical Hypotheses,’’ Statistical Research Memoirs of the University of London, June 
1936. The notatian of the present paper is taken from this article. 

2See Neyman’s article in the Giornale dell’ Insituto Italiano degli Attuari, Vol. VI, 
No. 4 (1935) as well as the memoir referred to in footnote 1. 





~ 


TWO PROPERTIES OF SUFFICIENT STATISTICS 


+, In| O01, +++ 0g, +++ , %) 
= 9(T, ,.-++5 Ta lh; «>, Oe, ~- >In 5 Ogi1, +++ , Or) 


Proof of the necessity is somewhat more involved. Since the 7; and 7; are 
both sets of algebraically independent statistics with regard to 6, --- , 4, 
equations (1) and (2) are satisfied. They are, in fact, identities when the 
values of 7,,---, 7m and T;,---, 7m in terms of the 2x; are substituted. 
Division of (1) by (2) and multiplication leads to the equation 


p(T, Mei Tm | A, ee » 9g) a ¢' (x1, a Ln; G41, pie 8) 


(3) of (T,, «++, Tel, -++,@) Om, soso Matias, «xv, 0e 


The right side of (3) is free of 0,,--- ,6,. Therefore, in reality the left side 
must be too. If some or all of the parameters 6,, --- , 6, enter formally into 
the left side, we can choose m + 1 sets of values 6}, --- , 0, (¢ = 1, ---,m+1) 
such that each of the m + 1 functions p(T;, ---, Tm|6i,---,6,) + p(T1, 
.-»,7n|6,, +++, 0) differs formally from all of the others. We can, then, 
since each is equal to the right side of (3) which is free of 6, --- , 8,, equate 
any one of these functions to the remaining m in turn. This provides m inde- 
pendent equations whose very existence proves that the 7; are functions of the 
T; and vice versa. 

If none of the parameters 6; , --- , 0, enters formally into the left side of (3), 
P(T1,---, Tm\| 01, +++ , 0) must be of the form p(T, --- , Tm)g(01, --- , 9) 
oad oT, ---, Tal, ---, 0) of the fom 91, --> « Toth, >> 4. 
In this case the original probability law p(a,---,%n|01,---,0,--:, 4) 
contains 6,, --- , 8, only nominally and there can be no talk of any statistics 
designed to estimate these parameters either singly or in combination. 

When m = 1 and the set of algebraically independent statistics reduces to 
one, the single statistic is termed a shared sufficient statistic of the parameters 
6:,---,0,. For this special case, Theorem I can be restated as follows. If 
T is a shared sufficient statistic with regard to the population parameters 
6,, +--+ , 6, and the probability distribution p(a,, --- ,2n| 01, --- , 4, +--+ , 4), 
the necessary and sufficient condition for the sufficiency of any statistic T’ 
with regard to the same parameters and the same probability distribution is 
that T’ be a function of 7. When m and q both equal one, the statistic becomes 
a sufficient statistic in the sense originally defined by Fisher in 1922. 

A physical law is independent of the coordinate system used to express it. 
This fact is taken account of in modern physics through the employment of 
tensors. One might hope for a parallel situation in the relation between suffi- 
cient statistics and the probability law to which they refer. Given any 1 
parameter family of distribution laws p(a1, --- ,2n| 61, --- , 92), the substitu- 


*See the memoir mentioned in footnote 1. 
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tion 0; = 6;(6;, --- , 0) (¢ = 1, --- , 1) leads to the equally valid representation 
of the family 


p’(t1, +++ ,2n| 01, --- , 8) 
= pir, --- ,%n| (01, --+ , 0%), «++ , O01, «++ , 03). 


Is a set of statistics sufficient with respect to the first representation also suffi- 
ciént with respect to the second? The answer is partly in the affirmative and 
is given by the following proposition. 

TuHeoreM II. [If the set of algebraically independent statistics T; , --- , Tm is 
sufficient with regard to the parameters 0,,---,6, and the probability law 
p(t1, +++ ,2n| 01, +++, 09, +++ , 0), tt is also sufficient with regard to 0; , --- ,* 
and any other representation p'(%1,--- ,2n| 6; ,+++,0g,-+++,61) of the same 
probability law provided 6; (i = 1, --- , q) are independent functions of 0: , --- , 0, 
only and 6; (jg =@qt+1,--- 52) are functions of 0941, «++ , 0: only. 

ProoF: The proof of the theorem is obvious. We are given the fact that 
p(x, -e? ae 61, ee » 9, ea , 91) aoe p(T, -++,Tm| 01, ou , Og) (21 , rh ee 
Inj Oot, +++, 61). Since the 6; (¢ = 1,---,g) are functions of 6,, --- & 
only and the 6; (j = q + 1, ---, 1) are functions of 0,41, --- , 6; only, it follows 
that 6; = 0:(01,---,6,) (¢ = 1,---,q) and 0; = 6,(i41,---, 61) (j = 
q+1,---,l). Consequently, 


p' (tr, «++ Bal Gi, +++ 50, <> ht) 
(4) 


= p'(T1, «++, Tm| Or, +++ »0q)-@'(t1, «++ Bn 3 Boer, ** 9A) 


and the theorem is established. 


New York, N. Y. 


NOTE ON THE MOMENTS OF A BINOMIALLY DISTRIBUTED VARIATE 
By W. D. Evans 


J. A. Joseph, has given two interesting triangular arrangements of numbers, 
the second of which is reproduced herewith as Table 1." The successive rows 
in this table are the coefficients in the expansion of x” as a function of the fac- 
torials x“, using the notation of the calculus of finite differences. For example, 

4 


ot = 2 + 62 4 72 + 2, 
where 
a” = a(x — 1)(x — 2)--- («@ —¢ + 1). 


Joseph points out that the coefficients may be used to generate the numbers 
of Laplace. 

1J. A. Joseph, “On the Coefficients of the Expansion of X,’’ Annals of Math. Stat., 
Vol. X (1939), p. 293. 
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A general expression defining any of the coefficients in terms of its place of 


occurrence in Table 1 may be set up. If we denote by F.(r) the number in 
row r and column c of the table, we have 


(1) = vee DY hen (r >). 
1 1 1 1 

This expression is of additional interest since the numbers defined by it are 

likewise the coefficients in the expression of the z-th moment about the origin 

of a binomially distributed variate in terms of the probability of the variate and 

the size of the sample in which it is contained. For example, it may be easily 


TABLE 1 
3 4 


1 
7 
1 25 15 1 


Fi(r) Fo(r) Fart) Far) Felt) «= Fe(r) 


verified that if a is such a variate, p its probability of occurrence, and n the size 
of the sample in which it is contained, 


E(a)’ = np’ + np 
E(a)’ = np’ + 3np* + np 
E(a)* = n@p* + 6np® + 7np® + np 
and so on. 
Ordinarily, computation of the higher moments of a binomially distributed 


variate is a tedious process of repeated differentiation. However, equation (1) 


immediately permits us to generalize the foregoing expressions to give the z-th 
moment of a as follows: 


ki-1 


z—1 z—-i ki 
(2) E(a)* = dnp a ki » Rae x i. 


It will be noted that when c — 1 in equation (1) and 7 in equation (2) are equal 
to zero, the repeated summations vanish to be replaced by the value one. 

By means of equation (2) much of the labor usually involved in expressing 
the z-th moment about the origin of a binomially distributed variate in terms 
of n and p may be avoided. 


Wasuinoton, D. C. 





REPORT OF THE ANNUAL MEETING OF THE INSTITUTE 


The fifth annual meeting of the Institute of Mathematical Statistics was 
held in Philadelphia, Pennsylvania, on December 27 and 28, 1939, in conjunc- 
tion with the meetings of the American Statistical Association, the Econometric 
Society, and the American Sociological Society. The program for the meeting 
was arranged by Professor C. C. Craig. 

On Wednesday morning, December 27, the Institute held a session devoted to 
contributed papers on Statistical Theory and Methodology. Professor P. R. 
Rider, President of the Institute, presided. At that time the following papers 
were presented: 


1. On the unbiased character of certain likelihood-ratio tests when applied to normal 

systems. 
Joseph F. Daly, The Catholic University of America. 

. The product seminvariants of the mean and a central moment in samples. 
C. C. Craig, University of Michigan. 

. A method for minimizing the sum of absolute values of deviations. 
Robert Singleton, Princeton Local Government Survey. 

. On certain criteria for testing the homogeneity of k estimates of variance. 
C. Eisenhart and Frieda S. Swed, University of Wisconsin. 

. Ona test whether two samples are from the same population. 
A. Wald and J. Wolfowitz, Columbia University and Brooklyn, New York. 

. The power functions of certain tests of significance in harmonic analysis and lag cor- 
relation. 
William G. Madow, Washington, D.C. 

. Some theoretical aspects of the use of transformations in the statistical analysis of rep- 
licated experiments. 
W. G. Cochran, Iowa State College. 

. The standard errors of geometric and harmonic types of index numbers. 
Nilan Norris, Hunter College. 

. A study of R. A. Fisher’s z distribution and the related F distribution. 
L. A. Aroian, Hunter College. 

10. A note on the analysis of variance with unequal class frequencies. 
Abraham Wald, Columbia University. 
11. Anapproach to problems involving disproportionate frequencies. 
Burton D. Seeley, U. S. Department of Labor. 
Abstracts of these papers are given at the close of this report. 


Immediately following the session just described, the Institute held its annual 
business meeting. At that time President Rider announced that the newly 
elected officers for the year 1940 are: President, S. S. Wilks, Princeton Uni- 
versity; Vice-Presidents: C. C. Craig, University of Michigan, and A. T. Craig, 
University of Iowa; Secretary-Treasurer: P. R. Rider, Washington University. 

At one o’clock on the same day, members of the Institute and their guests 

108 
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attended the annual luncheon. At the luncheon, Professor B. H. Camp ad- 
dressed the Institute on Non-standard Deviations. 

On Wednesday afternoon, the Institute met jointly with the American Statis- 
tical Association for a program devoted to Lag Effects in Statistics and Eco- 
nomics. Professor J. D. Tamarkin presided and at this time the following 
papers were read: 


1. Lag effects in statistics and related problems. 
A. J. Lotka, Metropolitan Life Insurance Company. 
2. Some methods in the analysis of lag effects. 
H. T. Davis, Northwestern University. 
3. Lag effects in economics. 
Charles F. Roos, Institute of Applied Econometrics, Inc. 


A joint session with the Biometric Section of the American Statistical Associa- 
tion was held on Wednesday evening, Professor George W. Snedecor presiding. 
The papers presented at this session, which dealt with Design and Analysis of 
Replicated Experiments, were the following: 


1. Practical difficulties met in the use of experimental designs. 
A. E. Brandt, Soil Conservation Service. 
2. Factorial design and covariance in the biological assay of vitamin D. 
C.I. Bliss, Sandusky, Ohio. 
3. Combinatorial problems in the design of experiments. 
Gertude M. Cox, Iowa State College. 
4, Experimental trials with balanced incomplete blocks. 
W.J. Youden, Boyce Thompson Institute. 


On Thursday afternoon the Institute held consecutively joint sessions with 
the American Sociological Society and the Econometric Society. At the first of 
these, Professor William F. Ogburn presided and the following program was 
presented : 


1. How the mathematician can help the sociologist. 
Samuel A. Stouffer, University of Chicago. 
2. Some problems of combinations and permutations as they apply to a comprehensive 
classification of social groups. 
George A. Lundberg, Bennington College. 
Discussion: C. C. Craig, University of Michigan. 
Philip M. Houser, U. 8. Bureau of the Census. 


At the second session the topic for discussion was Recent Advances in Business 
Cycle Analysis and these papers were given: 


. Recursive methods in business cycle analysis. 
Merrill M. Flood, Princeton Surveys. 

. An appreciation of some recent mathematical business cycle theories. 
Gerhard Tintner, Iowa State College. 

. The statisticians’ new clothiers. 
Arne Fisher, Western Union Telegraph Company. 


Paut R. Riwer, Secretary. 





ABSTRACTS OF PAPERS 
(Presented on December 27, 1939, at the Philadelphia meeting of the Institute) 


On the Unbiased Character of Certain Likelihood-Ratio Tests when Applied to 
Normal Systems. JosepH F. Daty, The Catholic University of America. 


Consider a random sample of N observations on a set of variates z!, --- , 2%, where 
zi, --- , z* are assumed to be normally distributed about means which are linear functions 
m* = Xbiz’ of the fixed variates z*+!, --- , 2°. Oneissometimes required to decide whether 
the sample tends to contradict the further hypothesis, Ho , that the coefficients b} belonging 
to a certain subset of the fixed variates, say z**!, --- , z*t*, have the specific values bi, : 
Such a situation occurs, for example, in the generalized analysis of variance. In this paper 
it is shown that the Neyman-Pearson method of the ratio of likelihoods yields a test of Hy 
which is (at least locally) unbiased; in other words, this test is less likely to reject Ho when 
the sample is in fact drawn from a normal population in which b* = b}, than when it is drawn 
from a normal population in which the b} are different from but sufficiently close to bi). 
In the special cases k = 1 or h = 1 the proof goes through even without the restriction that 
the true bi be close to b},, a result which is also implicit in the papers by P. C. Tang and 
P. L. Hsu (Stat. Res. Mem. Vol. 2). 

Similarly with respect to the hypothesis H; that the deviations z‘ — =biz’ fall into 
certain mutually independent sets the A-test is at least locally unbiased; and it has the 
additional property that the expected value of any positive integral power of Vx is greater 
when H,; is true than when the sample is drawn from any other normal population. 


The Product Seminvariants of the Mean and a Central Moment in Samples. 
C. C. Craic, The University of Michigan. 


The method used by the author in calculating the product seminvariants of a pair of 
central moments in samples is not adapted without modification to the present problem. 
In the present paper the necessary modification is developed which gives a routine method 
for the calculation of these sampling distribution characteristics. The calculation is a 
little he&vier than in the previous case but the results for the mean and the second, third, 
and fourth central moments are given up to the fourth order except in one case in which the 
weight is13. It is planned to follow this with a further study of the distribution of Fisher’s 
t in samples from a normal population. 


A Method for Minimizing the Sum of Absolute Values of Deviations. RosBErT 
SINGLETON, Princeton Local Government Survey. 


E. C. Rhodes (Philosophical Magazine, May 1930) presented a method for the estimation 
of parameters in a linear regression where it is desired to minimize the sum of absolute 
values of the deviations. In this paper the structure of the deviation surface is analyzed 
and a method of steepest descent is developed which for computational purposes is an 
improvement over Rhodes’ method. The process is finite and leads to an exact solution. 
The method and the formulae used are such as to permit the successive additions of new 
observations or sets of observations to the original data, or the exclusion of an observation 
from the original set, and the determination of the parameters for the sets of data so de- 
rived, with little additional labor. 
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On Certain Criteria for Testing the Homogeneity of k Estimates of Variance. 
C. EISENHART AND FRriepa 8. Swep, University of Wisconsin. 


Given k variance estimates sj , 8} , --- , 8; with n,s?2, (r = 1, 2, --- , k}, independently 
distributed as xo? for n, degrees of freedom, tests of the hypothesis, Ho, that «2 = o’, hi 
(r = 1, 2, --- , k), where o? is unknown, have been based to date on one or the other of the 
quantities 
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where the w, are weights, w = 7 Ur,n= p> nm, , and ns? = yy n,s2. A.E. Brandt and 
y= yo vv 


W. L. Stevens have advocated the use of Q; , referring an observed value of Q; to the x? 
distribution for k — 1 degrees of freedom. J. Neyman, E. S. Pearson, B. L. Welch, and 
M. S. Bartlett have advocated tests based on Q: , Bartlett definitely proposing the use of 
degrees of freedom as weights, i.e. w, = n, , and recent work of E. J. G. Pitman and others 
has shown that unless w, = n, tests basedonQ,arebiased. (Astatistical test of an hypoth- 
esis H is said to be unbiased when the probability of rejecting H by its use is a minimum 
when H is true; obviously a desirable property.) When w,; = n, Bartlett has suggested that 


1 
the distribution of Q2 can be satisfactorily approximated by referring Q:/ + 3(k — 1) 
& 1 
( De 9) to the x? distribution for k — 1degreesof freedom. In this paper we discuss 
r=1 


the adequacy of the x? distribution to describe the distribution of Q: and of the adjusted 
Q: when the degrees of freedom, n, , are small. 

U.S. Nair and D. J. Bishop have given theoretical evidence which suggests that when 
ny > 2, (r = 1, 2, --- , k), Bartlett’s adiusted Q2 may be expected to conform to the x? 
distribution reasonably well in the neighborhood of the 5% and 1% levels. Using 1000 
samples of 4 for which n,s?/(n,4:) has been tabulated by W. A. Shewhart in Table D, Ap- 
pendix II of his ‘‘Economic Control of Quality of Manufactured Product,’’ 200 values of 
Q: and Q2 (with adjustment) were calculated and compared with the x? distribution for 
k — 1 degrees of freedom. Two cases were studied: Case I, k = 5andm = n:, = --+ = 3; 
Case II, k = 3 and m = nz = 3 while n; = 9. As measured by the Chi-Square Goodness of 
Fit Test, using 11 degrees of freedom, the fits were good in all four instances. In Case I, 
for Bartlett’s adjusted Q, the test led to .80 < P < .90, and to.70 < P < .80 for the Brandt- 
Stevens Q, ; in Case II, the fits were poorer with .50 < P < .70 for Bartlett’s criterion and 
10 < P < .20 for the Brandt-Stevens. However, an examination of the descending cumula- 
tive distributions showed that in all instances these criteria exhibited a deficiency of large 
values of x?, with the deficiency, in general, more marked in the case of the Brandt-Stevens 
test. Consequently, when one uses significance levels for these criteria obtained by means 
of the x? approximation advocated, one is in reality using a level of significance slightly " 
less than that professed. The discr-pancy is not great, however, and is on the safe side, i.e. 
one will reject Ho falsely in the long run less often than one professes to be doing. Without 
doubt, however, one will also detect the falsehood of Ho when co? # o? , for at least one pair 
of values of r and t, r ¥ t, less often in the long run by the use of these approximate signifi- 
cance levels than if the true levels were used, but we have no definite evidence at present 
on this point. A somewhat disquieting feature is that the agreement between the x? values 
yielded by the two criteria becomes worse as one proceeds toward larger values of x? in 
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terms of either quantity. Thus, of 8 samples which Q2 would have rejected at the 5% level 
in Case I, only 4 of these would have been rejected by Q: , and Q2 would have passed 3 
samples of the 7 rejected by Q:. Thus it appears that, if one wishes to work with a given 
chance of rejecting Ho falsely, ones ,ould choose one of these criteria and then stick to it in 
future applications. For large values of the n, the two criteria tend to equivalence, so the 
choice between them is of interest mainly for small n, , but cannot be made with full in- 
formation until more is known about the bias, if any, of the Brandt-Stevens test, and the 
relative power of the two tests with regard to alternatives to Ho. 


On a Test Whether Two Samples are from the Same Population. A. Wap 
AND J. WoLFow1Tz, Columbia University and Brooklyn, New York. 


Let X and Y be two independent random variables about whose distributions nothing is 
known except that they are continuous. Let 2, 22, --- , 2m be a set of m independent 
observations on X and let y:, y2, --* , Yn be a set of nm independent observations on Y. 
The null hypothesis to be tested is that the distributions of X and Y are identical. 

Let the set cf m + n observations be arranged in order of magnitude, thus: 21, 22, --+, 
Zm4n- Replace z; by v; (¢ = 1, 2, --- , m+n) where v; = 0 if z; is a member of the set of 
x’sandv; = 1, if z;is a member of the set of y’s. Since the null hypothesis states only that 
the distributions of X and Y are identical without specifying them in any other way, the 
distribution of the statistic U used for testing the null hypothesis must be independent of 
this common distribution of X and Y. It can easily be shown that the statistic U must be 
a function only of the sequence 1) , v2, +++ , Umin- 

A subsequence v5 , Vs41 , *** » Usyr (Where r may also be 0) is called a run if vs = 441 = 

+ = Vs,r and if v5.1 # vs when s < 1 and if vs,- ¥ Vsyr41 Whens +r< m+n. The 
statistic U defined as the number of runs in the sequence 2; , v2, --- , Umyn Seems a Suitable 
statistic for testing the null hypothesis. A difference in the distribution functions of X 
and Y tends to decrease U. Hence the critical region is defined by the inequality U< uo, 
where uo depends only on m, n, and the level of significance adopted. If m < n and 
P{U = ¢} is the probability that U = c, then: 
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m-1 - n-10, m-1C), n-1 Ss 
PiU =2K —1) =| Cri ee aCe) 


P{U = 2K} = (K = 1,2, ---,m), 


(K = 2,3, ---,m+1). 


The mean of U is: 


The variance of U is: 


2mn(2mn “a> n) ; 


(m + n)?(m +n — 1)" 
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If — = a (a positive constant) and m— ~, the distribution of U converges to the normal 
n 


distribution. 


The Distribution of Quadratic Forms In Non-Central Normal Random Vari- 
ables. Witi1AmM G. Mapow, Washington, D. C. (Presented to the Institute 
under a slightly different title) 
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Let the distribution of a sum of non-central squares of normally and independently dis- 
tributed random variables which have the unit variances be called the x’ distribution. 
It is proved that if a set of quadratic forms have a sum which is the sum of the squares of 
their variables, then a necessary and sufficient condition that the quadratic forms be inde- 
pendently distributed in x’? distributions is that the rank of the sum of quadratic forms be 
equal to the sum of the ranks of the quadratic forms. Furthermore, the constants on which 
the x’? distributions depend may be obtained by substituting the values about which the 
variables are taken for the variables themselves in the quadratic forms. Roughly speaking 
the theorem states that if a set of quadratic forms satisfy the conditions of the Fisher- 
Cochran theorem when the true means vanish, then the set of quadratic forms will be 
independently distributed in x’? distributions when the true means do not vanish. 





Some Theoretical Aspects of the Use of Transformations in the Statistical 
Analysis of Replicated Experiments. W.G. Cocuran, Iowa State College. 


The device of transforming the data to a different scale before performing an analysis of 
variance has recently been recommended by a number of writers for replicated experiments 
in which the original datashow a markedly skew distribution. Theuse of transformations 
to obtain an approximate analysis has been supported mainly on the grounds that in the 
transformed scale the true experimental error variance is approximately the same on all 
plots. This paper considers the relation of the method of transformations to a more exact 
analysis. Discussion is confined to the »/z and sin! \/z transformations, which appear 
to receive the most frequent use in practice. 

To obtain an exact analysis, it is necessary to specify (7) how the expected value on any 
plot is obtained from unknown parameters representing the treatment and block (or row 
and column) effects (iz) how the observed values on the plots vary about the expected 
values. If the latter variation follows the Poisson law, (a case to which the square root 
transformation has been considered appropriate), the equations of estimation by maximum 


likelihood take the form 
z—m\odm H 
>( m ) ac . i 


where z is the observed and m the expected value on any plot, c is a typical unknown para- 
meter, and the summation extends over all plots whose expectations involve c. As the 
number of parameters is usually large (e.g. 16 in a 6 x 6 Latin square), these equations are 
laborious to solve; moreover, the question of obtaining small-sample tests of significance is 
difficult. It is shown that if a particular form can be assumed for the prediction formula 
in (i), namely that +/m is a linear function of the treatment and block (or row and column) 
constants, the equations of estimation may be reduced to the simpler form 


(2) ae 4(r’ — /m) = 0, 







(1) 














1 2 ; 
where r’ = 3( vm + =.) is a function closely related tothesquarerootofz. It follows 
a/m 


that the statistical analysis in square roots, with some slight adjustments, coincides with 
the maximum likelihood solution, provided that the above form can be assumed for the 
predictionformula. The appropriateness of this form in practice is briefly considered and a 
“goodness of fit’’ test by xis developed. A numerical example is worked as an illustration 4 
and indicates that a good approximation is obtained by the transformation alone even 
with very smallnumbers per plot. The corresponding theory is also discussed for the inverse F 
sine transformation, which applies where the original data are percentages or fractions t 
whose experimental errors are derived from the binomial distribution. 
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In practice the type of analysis outlined above is unlikely to supplant the simple use of 
transformations, because it can seldom be assumed that the experimental variance ig 
entirely of the Poisson or binomial type. The more exact analysis may, however, be 
useful (i) for cases in which the plot yields are very small integers or the ratios of very 
small integers (ii) in showing how to give proper weight to an occasional zero plot yield. 


The Standard Errors of Geometric and Harmonic Types of Index Numbers. 
By Nizan Norris, Hunter College. 


Various statisticians have made empirical studies of the sampling errors of certain types 
of index numbers used in the United States and England. None of these writers has taken 
advantage of the tools afforded by the modern theory of estimation, including fiducial 
inference, as a means of arriving at direct and general expressions for estimating the stand- 
ard deviations of the sampling errors of geometric and harmonic types of index numbers. 

A known expression for the first-approximation to the variance of a function, as given by 
the relation between the variance of the function and the variance of the argument, is 
valid for that general class of distributions of which the variance and a higher moment 
are finite. With the aid of this relation, there appear simple and useful forms for estimat- 
ing the standard errors of geometric and harmonic types of indexes. For sufficiently large 
samples, these forms are valid for all of the types of distributions of price relatives, produc- 
tion relatives, and similar observations ordinarily encountered, provided that there are 
satisfied the necessary conditions for drawing sound inferences on the basis of sampling 
without reference to the value of the variate. 

Necessary conditions for using tests of significance soundly in connection with index 
number problems are those of realistic and intimate acquaintance with observations, and 
careful attention to certain broad theoretical considerations which determine whether or 
not the index is suited for the purpose for which it is used. 


A Study of R. A. Fisher’s z Distribution and the Related F Distribution. L. A. 
Aro1an, Hunter College. 


The following results for the z distribution and related F distribution are investigated: 

(1) Geometric properties. 

(2) Exact values of the seminvariants and moments of z. Exact values of the first 
four central moments of F. 

(3) The approach to normality of both distributions as n; and nz become large in any 
manner whatever. ; 

(4) The Pearson types of approximating curves, the logarithmic normal approximation, 
the Gram-Charlier approximation, and the uses of these in finding any level of 
significance of z and of F. 


A Note on the Analysis of Variance with Unequal Class Frequencies. ABRAHAM 
Watp, Columbia University. 


Let us consider p groups of variates and denéte by m; (j = 1, --- , p) the number of 
elements in the j-th group. Let 2;; be the i-th element in the j-th group. [We assume that 
24; is the sum of two variates «;; and 7;, i.e. 21; = 4; + 9; where e;; (¢ = 1, ---, mj;j = 
1, --- , p) isnormally distributed with mean yw and variance o?, and n; (j = 1, --- , p) is 
normally distributed with mean yp’ and varianceo’*. All the variates e;; and 7; are supposed 
to be distributed independently. The intra-class correlation p is given by 
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Confidence limits for p have been derived only in case of equal class frequencies, i.e. m: = 
m,= ++: =m,. Wegive here the confidence limits for p in case of unequal class frequen- 


2 
; ’ s es ; 5 ee 
cies. Since p is a monotonic function of = » it is sufficient to derive confidence limits for 
Cg 


12 
Denote — by \? and the arithmetic mean of the j-th group by #;. Let 
og 


mj 


mF + yd?” 


and denote by F: and F; the lower and upper confidence limits respectively of F, where F 
has the analysis of variance distribution with p — 1 and N — p = m+--- +m,-—p 
degreesoffreedom. Then the lower confidence limit \j of \? is given by the root of the equa- 


tion in A?: 
- 2 
S69 2m 
vp eel® Zw; 


(1) f) = ot S2(2q — 2,)" 


= F,, 


and the upper confidence limit Aj of \? is given by the root of 
(2) fQ*) = Fi. 


For calculating the roots of (1) and (2), we can make use of the fact that f(A?) is mono- 
tonically decreasing with increasing \?. 


An Approach to Problems Involving Disproportionate Frequencies. Burton 
D. SEELEY, Washington, D. C. 


Applied mechanics offers an analysis of variance solution to problems of multiple classi- 
fication involving disproportionate sub-class numbers. The quality of orthogonality may 
be attained in such problems by measuring the variability between classes of any one 
classification after centering the others. This approach, which is not limited by the num- 
ber of classes or the number of classifications, treats the problem involving equal sub-class 
numbers as a special phase of the general analysis of variance. 





CONSTITUTION 
OF THE 
INSTITUTE OF MATHEMATICAL STATISTICS 


ARTICLE I 
NAME AND PURPOSE 


1. This organization shall be known as the Institute of Mathematical Statistics. 
2. Its object shall be to promote the interests of mathematical statistics. 


ARTICLE II 
MEMBERSHIP 


1. The membership of the Institute shall consist of Members, Fellows, Honorary 
Members, and Sustaining Members. 

2. Voting members of the Institute shall be (a) the Fellows, and (b) all others who 
have been members for twenty-three months prior to the date of voting. 


ARTICLE III 


Orricers, Boarp oF Directors, COMMITTEE ON MEMBERSHIP, AND COMMITTEE ON 
PUBLICATIONS 


1. The Officers of the Institute shall be a President, two Vice-Presidents, and a Secre- 
tary-Treasurer, elected for a term of one year by a majority ballot at the annual meeting 
of the Institute. Voting may be in person or by mail. 

(a) Exception. The first group of Officers shall be elected by a majority vote of the 
individuals present at the organization meeting, and shall serve until December 31, 1936. 

2. The Board of Directors of the Institute shall consist of the Officers and the previous 
President. 

3. The Institute shall have a Committee on Membership composed of three Fellows. 
At their first meeting subsequent to the adoption of this Constitution, the Board of 
Directors shall elect three members as Fellows to serve as the Committee on Membership, 
one member of the Committee for a term of one year, another for a term of two years, 
and another for a term of three years. Thereafter the Board of Directors shall elect 
from among the Fellows one member annually at their first meeting after their election 
for a term of three years. The president shall designate one of the Vice-Presidents as 
Chairman of this Committee. 

4. The Institute shall have a Committee on Publications composed of three Members 
or Fellows elected by the Board of Directors. The President shall designate a Vice- 
President as Ex Officio Chairman of this Committee. 


ARTICLE IV 


MEETINGS 


1. A meeting for the presentation and discussion of papers, for the election of Officers, 
and for the transaction of othér business of the Institute shall be held annually at such 
time as the Board of Directors may designate. Additional meetings may be called from 
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time to time by the Board of Directors and shall be called at any time by the President 
upon written request from ten Fellows. Notice of the time and place of meeting shall be 
given to the membership by the Secretary-Treasurer at least thirty days prior to the 
date set for the meeting. All meetings except executive sessions shall be open to the 
public. Only papers accepted by a Program Committee appointed by the President may 
be presented to the Institute. 

2. The Board of Directors shall hold a meeting immediately after their election and 
again immediately before the expiration of their term. Other meetings of the Board 
may be held from time to time at the call of the President or any two members of the 
Board. Notice of each meeting of the Board, other than the two regular meetings, 
together with a statement of the business to be brought before the meeting, must be 
given to the members of the Board by the Secretary-Treasurer at least five days prior to 
the date set therefor. Should other business be passed upon, any member of the Board 
shall have the right to reopen the question at the next meeting. 

3. The Committee on Membership shall hold a meeting immediately after the annual 
meeting of the Institute. Further meetings of the Committee may be held from time to 
time at the call of the Chairman or any member of the Committee provided notice of such 
call and the purpose of the meeting is given to the members of the Committee by the 
Secretary-Treasurer at least five days before the date set therefor. Should other business } 
be passed upon, any member of the Committee shall have the right to reopen the ques- 
tion at the next meeting. 

4. At a regularly convened meeting of the Board of Directors, three members shall 
constitute a quorum. At a regularly convened meeting of the Committee on Member- 
ship, two members shall constitute a quorum. 


ARTICLE V 


PUBLICATIONS 


1. The Annals of Mathematical Statistics shall be the Official Journal for the Institute- 
Other publications may be originated by the Board of Directors as occasion arises. 


ARTICLE VI 
EXPULSION OR SUSPENSION 


1. Except for non-payment of dues, no one shall be expelled or suspended except by 
action of the Board of Directors with not more than one negative vote. 


ARTICLE VII 
AMENDMENTS 


1. This constitution may be amended by an affirmative two-thirds vote at any regu- 
larly convened meeting of the Institute provided notice of such proposed amendment 
shall have been sent to each voting member by the Secretary-Treasurer at least thirty 
days before the date of the meeting at which the proposal is to be acted upon. Voting 
may be in person or by mail. 
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ARTICLE I 
Duties oF THE OFrFiceRS, BoarD oF Dr1rEcToRS, COMMITTEE ON MEMBERSHIP, AND 
CoMMITTEE ON PUBLICATIONS 

1. The President, or in his absence, one of the Vice-Presidents, or in the absence of the 
President and both Vice-Presidents, a Fellow selected by vote of the Fellows present, 
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shall preside at the meetings of the Institute and of the Board of Directors. At meetings 
of the Institute, the presiding officer shall vote only in the case of a tie, but at meetings 
of the Board of Directors he may vote in all cases. At least three months before the date 
of the annual meeting, the President shall appoint a Nominating Committee of three 
members. It shall be the duty of the Nominating Committee to make nominations for 
Officers to be elected at the annual meeting and the Secretary-Treasurer shall notify all 
voting members at least thirty days before the annual meeting. Additional nomina- 
tions may be submitted in writing, if signed by at least ten Fellows of the Institute, up to 
the time of the meeting. 

2. The Secretary-Treasurer shall keep a full and accurate record of the proceedings 
at the meetings of the Institute and of the Board of Directors, send out calls for said 
meetings and, with the approval of the President and the Board, carry on the corre- 
spondence of the Institute. Subject to the direction of the Board, he shall have charge 
of the archives and other tangible and intangible property of the Institute. He shall 
send out calls for annual dues and acknowledge receipt of same; pay all bills approved 
by the President for expenditures authorized by the Board or the Institute; keep a 
detailed account of all receipts and expenditures, prepare a financial statement at the 
end of each year and present an abstract of the same at the annual meeting of the Insti- 
tute after it has been audited by a Member or Fellow of the Institute appointed by the 
President as Auditor. The Auditor shall report to the President. 

3. The Board of Directors shall have charge of the funds and of the affairs of the 
Institute, with the exception of those affairs specifically assigned to the President or to 
the Committee on Membership. The Board shall have authority to fill all vacancies 
ad interim, occurring among the Officers, Board of Directors, or in any of the Committees. 
The Board may appoint such other committees as may be required from time to time 
to carry on the affairs of the Institute. 

4. The Committee on Membership shall prepare and make available through the 
Secretary-Treasurer an announcement indicating the qualifications requisite for the 
different grades of membership. 

5. The Committee on Publications, under the general supervision of the Board of 
Directors, shall have charge of all matters connected with the publications of the Insti- 
tute, and of all books, pamphlets, manuscripts and other literary or scientific material 
collected by the Institute. Once a year this Committee shall cause to be printed in the 
Official Journal the Constitution and By-Laws and a classified list of all the Members 
and Fellows of the Institute. 


ARTICLE II 
DvES 


1. Members shall pay five dollars at the time of admission to membership and shall 
receive the full current volume of the Official Journal. Thereafter, Members shall pay 
five dollars annual dues. The annual dues of Fellows shall be five dollars. The annual 
dues of Sustaining Members shall be fifty dollars. Honorary Members shall be exempt 
from all dues. 

2. Annual dues shall be payable on the first day of January of each year. 

3. The annual dues of a Fellow or Member include a subscription to the Official 
Journal. The annual dues of a Sustaining Member include two subscriptions to the 
Official Journal. 

4. It shall be the duty of the Secretary-Treasurer to notify by mail anyone whose dues 
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may be six months in arrears, and to accompany such notice by a copy of this Article. 
If such person fail to pay such dues within three months from the date of mailing such 
notice, the Secretary-Treasurer shall report the delinquent one to the Board of Directors, 
by whom the person’s name may be stricken from the rolls and ali privileges of member- 


ship withdrawn. Such person may, however, be re-instated by the Board of Directors 
upon payment of the arrears of dues. 


ARTICLE III 
SALARIES 


1. The Institute shall not pay a salary to any Officer, Director, or member of any 
committee. 


ARTICLE IV 


AMENDMENTS 


1. These By-Laws may be amended in the same manner as the Constitution or by a 
majority vote at any regularly convened meeting of the Institute, if the proposed amend- 
ment has been previously approved by the Board of Directors. 





DIRECTORY OF THE INSTITUTE OF MATHEMATICAL STATISTICS 


(As of January 1, 1940) 


Please notify the Secretary, P. R. Rider, Washington University, 
St. Louis, Mo., of any changes of addresses or errors in names, titles or 
addresses as printed. 


Anderson, Mr. Paul H. 108 E. Chalmers St., Champaign, III. 

Acerboni, Dr. Argentino V. Larroque 232 Banfield, Buenos Aires, Argentina. 

Alter, Prof. Dinsmore. Director of Griffith Observatory, Los Angeles, Calif. 

Arnold, Prof. Herbert E. Dept. of Math., Wesleyan Univ., Middletown, Conn. 

Aroian, Prof. Leo. Dept. of Math., Hunter College, Lexington Ave. and E. 68th St., 
New York, N. Y. 

Bacon, Prof.H.M. Box 1144, Stanford Univ., Calif. 

Baker, Dr. G. A. Experiment Station, College of Agriculture, Univ. of Calif., Davis, 
Calif.. 

Barral-Souto, Dr. José. Cordoba 1459, Buenos Aires, Argentina. 

Barrett, Mr. C. S. 3145 Maple Ave., Brookfield, Ill. 
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